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This invention relates to E2 rraflj-activation repressors which interfere with normal functioning of the native full-length E2 
transcriptional activation protein of the papillomavirus. Native full-length E2 fra/Lf-activation protein activates transcription of 
papillomavirus only through binding to DNA, and it binds to DNA only in the form of a pre-formed homodimer -- a pair of 
identical polypeptide subunits held together by non-covalent interactions. The E2 /ranj-activation repressors of this invention are 
proteins, polypeptides or other molecules that dimerize with full-length native E2 polypeptides to form inactive heterodimers, 
thus interfering with the formation of active homodimers comprising full-length native E2 polypeptides, thereby repressing papil- 
lomavirus transcription and replication. The E2 /ra/is-activation repressors of this invention are advantageously used in the treat- 
ment of papillomavirus infections and their associated diseases. 
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REPRESSORS OF THE TRANS-ACTIVATING FUNCTION OF 
OF PAPILLOMAVIRUS E2 PROTEINS 



TECHNICAL FIELD OF INVENTION 

This invention relates to E2 trans -activation 
5 repressors which interfere with normal functioning of 
the native full-length E2 transcriptional activation 
protein of the papillomavirus. Native full-length E2 
trans ^activation protein activates transcription of 
papillomavirus only through binding to DNA, and it 

10 binds to DNA only in the form of a pre- formed 

homodimer — a pair of identical polypeptide subunits 
held together by non-covalent interactions. The E2 
trans -activation repressors of this invention are 
proteins, polypeptides or other molecules that dimerize 

15 with full-length native E2 polypeptides to form 
inactive heterodimers , thus interfering with the 
formation of active homodimers comprising full-length 
native E2 polypeptides, thereby repressing 
papillomavirus transcription and replication. The E2 

20 trans -activation repressors of this invention are 

advantageously used in the treatment of papillomavirus 
infections and their associated diseases. 

BACKGROUND ART 



25 



Papillomaviruses are a group of small DNA 
viruses that cause disease and pathological conditions 
in animals and humans. These tumorigenic viruses 
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produce benign tumors or lesions which may, in some 
instances, develop into malignancies. Papillomaviruses 
have been implicated as a cause of cervical cancer, as 
veil as other anogenital and epithelial malignancies. 
5 Papillomaviruses consist of icosahedral 

particles containing protein and a single, circular, 
double-stranded ONA molecule averaging 7.8 Jcbp. To 
date, more than ten animal papillomaviruses and more 
than fifty-five human papillomaviruses have been 
10 identified (R. Sousa et al., "Control of Papillomavirus 
Gene Expression", Biochimica et Biophvsica Acta. 1032, 
pp. 19-37 (1990); E.M. DeVilliers, "Heterogeneity of- 
the Human Papillomavirus Group", J. Virol,, 63, 
pp. 4898-903 (1989)). One particularly studied 
15 papillomavirus is bovine papillomavirus ("BPV") . 

All known papillomaviruses encode similar 
proteins that perform analogous functions in infected 
cells. The E2 transcriptional activation protein ("the 
E2 protein") is a Jaans-acting factor that activates 
20 transcription through specific binding to cis-acting E2 
enhancer sequences (i.e., E2 binding sites) in viral 
DMA (E.J. Androphy et al., "Bovine Papillomavirus E2 
Trjui-Activating Gene Product Binds to Specific Sites 
in Papillomavirus DNA", Nature . 324, pp. 70-73 (1987)). 
25 The 410 amino acid papillomavirus E2 protein has been 
shown to induce promoter expression in a classical 
enhancer mechanism (B.A. Spalholz et al., " trans - 
activation of a Bovine Papilloma Virus Transcriptional 
Regulatory Element by the E2 Gene Product", Cell . 42, 
30 pp. 183-91 (1985)). As with other transcription 
factors, the functions of E2 protein appear to be 
localized to discrete modular domains (I. Giri and 
M. Yaniv, "Structural and Mutational Analysis of E2 
2£&Iis-Activating Proteins of Papillomaviruses Reveals 
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Three Distinct Functional Domains 1 *, EMBQ J, , 7, 

pp. 2823-29 (1988)) . 

Papill mavirus infections are non-lytic in 
their natural hosts, indicating that transcription and 
5 replication of the papillomavirus are tightly 

controlled. An upstream regulatory region ("URR") is 
found immediately 5 f to the early genes of BPV and 
other papillomaviruses. The URR contains sis-acting 
regulatory signals, including an origin of DNA 
10 replication and several promoters that function in 
early gene transcription. The URR also contains 
enhancer elements that activate transcription from the 
URR promoters and heterologous promoters (Sousa et al., 
SMBEft) • 

15 The E2 enhancer elements are conditional, in 

that they stimulate transcription only when activated 
by a protein encoded by a papillomavirus E2 open 
reading frame ("ORF") . Gene products from the E2 ORF 
include the full-length transcriptional activator E2 

20 protein and at least two truncated versions of the E2 
protein in BPV1 that function as transcriptional 
repressors. Transcriptional activation and repression 
of viral genes by E2 gene products constitute critical 
regulatory circuits in papillomavirus gene expression 

25 and DNA replication. E2 genes and DNA binding sites 
for E2 gene products appear to be characteristic of all 
papillomaviruses, although placement of the binding 
sites may vary Id . 

Transcriptional regulation by the E2 protein 

30 depends on its direct binding to the nucleotide 

sequence 5 1 ACCXNNNNYGGT3 1 , which is found within cis - 

* 

acting E2 enhancer elements in all papillomaviruses 
(Androphy et al., supra : Dartmann et al., "The 
Nucleotide Sequence and Genome Organization of Human 
35 Papilloma Virus Type 11", Virology . 151, pp. 124-30 
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(1986); H. Hirochika t al. # "Enhancers and Trans* 
Acting E2 Transcriptional Factors of Papill maviruses", 
J. Virol. . 61, pp. 2599-606 (1987); P. Hawley-Nelson 
et al., "The Specific DNA Recognition Sequence of the 
5 Bovine Papillomavirus E2 Protein is an E2 -Dependent 
Enhancer", EMBO J. . 7, pp. 525-31 (1988); A. A. McBride 
et al., "The Car boxy-Terminal Domain Shared by the 
Bovine Papillomavirus E2 Transact iva tor and Repressor 
Proteins Contains a Specific DNA Binding Activity", 
10 EMBO J. , 7, pp. 533-39 (1988)). In that sequence, N 
represents any nucleotide; X is any nucleotide — but 
is usually 6, and Y represents any nucleotide — but is 
usually C. E2 binding sites appear to be positioned in 
close proximity to the viral promoters, with seventeen 
15 E2 binding sites being present throughout the bovine 
papillomavirus genome (R. Li et al., "Specific 
Recognition Nucleotides and their DNA Context Determine 
the Affinity of E2 Protein for 17 Binding Sites in the 
BPV-1 Genome", Genes Dev. . 3, pp. 510-26 (1989)). 
20 Enhancer elements containing E2 binding sites are found 
in the URR's of all papillomaviruses, as well as in 
other sites near promoters throughout the viral genome. 

E2 binding sites may function as an element 
in viral DNA replication, as well as a classical 
25 transcriptional enhancer element. E2-mediated DNA 
binding, therefore, is essential for the natural life 
cycle of papillomaviruses* 

European patent application 302,758 refers to 
the use of modified forms of E2 protein that bind to, 
30 and block, E2 binding sites on papillomavirus DNA 
without resulting in trans -activation . That 
application also refers to repression of E2 activation 
through the use of DNA fragments that mimic E2 binding 
sites, and thus bind with E2 trans -activators , making 
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them unavailable for binding to E2 sites n the viral 
DNA* 

E2 protein also binds the papillomavirus 
replication protein known as El. It has been propos d 
5 that when an E2/E1 complex binds to an E2 binding site, 
replication of the viral genome occurs (M. Botchan 
et al., International Papillomavirus Workshop, 
Heidelberg, Germany (May 1990); Mohr et al., "Targ ting 
the El Replication Protein to the Papillomavirus Origin 

10 of Replication by Complex Formation with the E2 

Transactivator" , Science r 250, pp. 1654-99 (1990)). 

Pull-length E2 transcriptional activator 
polypeptides (monomers) have a molecular weight of 
about 50 kD. Although amino acid sequence homology 

15 among E2 proteins of various papillomaviruses is low 
(ca. 35%), the E2 proteins share conserved motifs that 
constitute unique structural domains having distinct 
functions (Giri and Yaniv, supra ) . 

The C-terminal domain of the E2 polypeptid 

20 is responsible for recognition of E2 binding sites on 
viral DNA. The N-terminal domain of the E2 polypeptide 
is responsible for transcriptional activation following 
binding of the protein to viral DNA (A. A. McBrid 
et al., "E2 Polypeptides Encoded by Bovine 

25 Papillomavirus Type 1 Form Dimers Through The Common 
Carboxyl-Terminal Domain: trans -Activation is Mediated 
by the Conserved Amino Terminal Domain", Proc. Natl. 
Acad. Sci. USA , 86, pp. 510-14 (1989)). The E2 protein 
binds to viral DNA in vivo only in the form of a pre- 

30 existing homodimer I£. Dimeric E2 proteins exert 

control of papillomavirus promoters by directly binding 
to an inverted repeat that has been found in all such 
viruses . 

* 

In bovine papillomavirus models, and in some 
35 human papillomaviruses, at least two N-terminally 
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truncated E2 proteins occur naturally and act as native 
repressors. It has been experimentally c nfirmed in 
vitro that truncated forms of E2 pr teins which retain 
their ability to bind DNA but do not JaailS-activat / 
5 are competitive inhibitors of Jtcang-activation- 
competent E2 polypeptides (P.F. Lambert et al., "A 
Transcriptional Repressor Encoded By BPV-1 Shares A 
Common Car boxy-Terminal Domain With The E2 
Transactivator" , Cell , 50, pp. 69-78 (1987); 

10 A. Stenlund and M.R. Botchan, n The E2 Trans-Activator 
Can Act as a Repressor by Interfering with a Cellular 
Transcription Factor w , Genes Dev. , 4, pp. 123-36 
(1990); J. Choe et al., "Bovine Papillomavirus Type 1 
Encodes Two Forms of a Transcriptional Repressor: 

15 Structural and Functional Analysis of New Viral cDNAs", 
J. Virol. . 63, pp. 1743-55 (1989)). That inhibition 
has never been definitively attributed to competition 
for DMA binding sites, for E2 polypeptides in the 
dimerization process, or for both. It has been 

20 suggested that transcriptional repression occurs 

through direct competition with the native full-length, 
i.e., transcriptionally active E2 protein at the DNA 
binding site. PCT patent application W089/ 12461 refers 
to peptide inhibitors of viral gene expression and 

25 viral replication. Those inhibitors are said to bind 
to trans -activator binding sites in viral DNA, thus 
blocking normal binding of native trans -activating 
proteins to those sites. And it has been suggested 
that formation of non-functional protein complexes 

30 could also prevent E2 activation of transcription (P.F. 
Lambert et al . , supra \ . 

Although it is known that papillomavirus E2 
protein is the sequence-specific DNA binding protein 
that coordinates papillomavirus transcription, the 
35 structures of its DNA binding and dimerization motifs 
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hav never been determined. Both th DMA binding 
activity and the dim rizati n signal of the 
papillomavirus E2 trans-activation prot in reside in 
the carboxy terminal 100 amino acids of the prot in 
5 (McBride et al., supra) . The C-terminal 100, 125 or 
249„amino acids of E2 protein (each of which lacks 
jfccana-activation activity) all repress E2-dependent 
gene expression (T. Haugen et al., "Seguence-Sp cific 
and General Transcriptional Activation by the Bovin 
10 Papillomavirus-1 E2 Trans-Activator Require an 

N-Terminal Amphipathic Helix-Containing E2 Domain" , 
EH89 Jt , 7, pp. 4245-53 (1988)). Although the capacity 
for E2 dimerization, as well as the capacity for site- 
specific DHA binding, are known to reside in the c- 
15 terminal domain of the E2 polypeptide, the amino acid 
region within that domain responsible for E2 
dimerization has not been identified (Giri and Yaniv, 
SMBCa) . To date, the dimerization function of the E2 
polypeptide has not been separated from its DMA-binding 
20 function* Accordingly, repressors that inhibit 
papillomavirus transcription and replication by 
interfering with dimerization of native full-length E2 
polypeptides have remained unknown* r 

DISCLOSURE OF THE INVENTION 

By virtue of the present invention, the 
dimerization function of the E2 polypeptide has been 
separated from its DNA binding function. That 
separation has enabled, for the first time, the 
production of E2 trans act i vat ion repressor 
polypeptides that are homologous to papillomavirus E2 
polypeptides and which inhibit transcription and 
replication of papillomaviruses by interfering with 
dimerization of native full-length E2 polypeptides. 
These E2 icans -activation repressors advantageously 



25 



30 
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exert their anti-viral ffects by interfering with E2- 
protein-mediat d enhancement of papillomavirus 
transcription in cells infected with that virus. Th 
E2 trans -act i va ti on repressors of this invention ar 
5 characterized by their ability to form inactive E2 
heterodimers with full-length native £2 polypeptides 
produced by the papillomavirus and, therefore, to 
interfere with the formation of active homodimers by 
those polypeptides. By virtue of those abilities, the 
10 E2 trans -activation repressors reduce the availability 
of full-length native E2 polypeptides for formation of 
active homodimers, thus repressing papillomavirus 
transcription and replication. 

According to one embodiment of this 
15 invention, E2 tpans -activation repressors comprise at 
least the dimerization region, but less than the DNA 
binding domain, of the E2 polypeptide. Such 
repressors, which interfere with DNA binding by full- 
length E2 polypeptides through formation of inactive 
20 heterodimers and which comprise less than the DNA 
binding domain of the E2 polypeptide, by virtue of 
their reduced size, advantageously reduce the potential 
problem of repressor uptake into papillomavirus- 
infected cells. These repressors are useful in 
25 processes and compositions for treating papillomavirus 
infections. 

This invention also relates to methods for 
isolating mutations in DNA encoding polypeptides that 
are homologous to native E2 polypeptides and which form 
30 inactive heterodimers with native full-length E2 

polypeptides. Such mutations are useful in processes 
and compositions for the treatment of papillomavirus 
infections. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 depicts the amino acid sequence of a 
segment of the wild type BPV1 E2 polypeptide between 
amino acids 285 and 410, which is the "DNA binding 
5 domain 11 . Rectangles (and abbreviations immediately 
above the rectangles) indicate changes made in that 
amino acid sequence to produce E2 mutants (homologues) / 
including the E2 trans -activation repressors of this 
invention. 

10 Figure 2 tabulates the DNA binding, 

dimerization and repression activities of various E2 
mutants prepared according to this invention. 

Figure 3 shows autoradiograms of 
electrophoresis gels from DNA binding ("gel shift") 

15 assays. 

Figure 4 shows autoradiograms of 
electrophoresis gels from a super-shift DNA binding 
assay performed with dimerization-defective mutant 
polypeptide 3 60S. 
20 Figure 5 schematically depicts the 

construction of plasmid pXB332. 

Figure 6 schematically depicts the 
construction of plasmid pXB323hGH. 

Figure 7 schematically depicts the 
25 construction of plasmid pXBlOl. 

Figure 8 schematically depicts the 
construction of plasmid pXB323. 

Figure 9 schematically depicts the 

# 

construction of plasmid pXB314. 
30 Figure 10 schematically depicts the 

construction of plasmids pEC337L, pEC339M, pEC340F, 
PEC340R, pEC340Y and pEC344L. 

Figure 11 schematically depicts the 
construction of plasmids pBG331 and pAHE2. 
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Figur 12 schematically depicts th 
construct i n of plasmid pET8c-85. 

Figure 13 schematically depicts the 
construction of plasmid pHE2-85. 
5 Figure 14 schematically depicts the 

construction of plasmid pET8c-123. 

Figure 15 schematically depicts the 
construction of plasmid pHE2-123. 

Figure 16 schematically depicts the 
10 construction of plasmid pFTESOl. 

DETAILED DESCRIPTION OF THE INVENTION 

In order that the invention herein described 
may be more fully understood, the following detailed 
description is set forth. 
15 In the description, the following terms are 

employed : 

E2 trans-Activation Repressor — Any protein, 
polypeptide or other molecule that interferes with 
transcriptional activation of papillomavirus resulting 
20 from binding of full-length native E2 polypeptides in 
the form of pre-existing homodimers to E2 binding sites 
on DNA. 

DNA Binding Domain — Amino acids 285-410 of 
BPVl E2 protein, or the corresponding homologous region 
25 of an HPV E2 protein* 

Native Minimal DNA Binding Domain — Amino 

acids 325-410 of BPVl E2 protein, or amino acids 

» 

283-365 of HPV16 E2 protein, each amino acid sequence 
being sufficient for dimerization and binding to E2 DNA 
30 binding sites. 

Homologous — An amino acid sequence very 
similar to at least a portion of the W DNA binding 
domain" but having at 1 ast one mutation therefrom, or 
a nucleic acid sequence encoding an amino acid sequence 
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very similar t at least a portion of the "DNA binding 
domain" but having at 1 ast one mutation therefrom. 

Homoloaue — A polypeptid or nucl ic acid 
that is homologous to a native E2 polypeptide or a 
5 native E2 gene, respectively. 

Mutant — Homologue or homologous. 
Mutation — A substitution, insertion or 
deletion in a gene encoding a desired protein or 
polypeptide. 

10 fl2 Protein Dimerization Region — The region 

of the DNA binding domain that is necessary and 
sufficient for dimerization but not sufficient for 
binding of the dimer resulting from that dimerizati n 
to DNA. 

15 Transport Moiety — Any covalent addition to 

an E2 trans -activation repressor that facilitates entry 
of that repressor into target cells. 

Inactive Heterodimers — Dimers that comprise 
two non-identical polypeptide subunits and which do not 
20 bind to E2 binding sites on DNA. 

Active homodimers — Dimers that comprise two 
identical E2 polypeptide subunits held together by non- 
covalent interactions and which cause transcriptional 
activation upon binding to E2 DNA binding sites. 
25 Reporter Gene — A gene whose expression 

depends on the occurrence of a cellular event of 
interest and can be conveniently observed in a 
genetically transformed host strain*. 

Re porter Plasmid — A plasmid vector that 
30 comprises one or more reporter genes. 

Reporter Strain — A genetically 
transformable unicellular host strain that comprises 
one or more reporter plasmids. 

Amino Acid — A monomer ic unit of a peptide, 
35 polypeptide or protein. The twenty protein amino acids 
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(L-isom rs) are: phenylalanine ("Phe" or "P") , leucine 
("Leu", "L"), isoleucin ("II " , "I") , methionin 
("Met", "M"), valine ("Val", "V") , serine ("Ser", "S") , 
proline ("Pro", "P") , threonine ("Thr", "T") , alanine 
5 ("Ala", "A") , tyrosine ("Tyr", "Y"), histidine ("His 11 , 
"H"), glutaaine ("Gin"/ "Q") , asparagine ("Asn", "N"), 
lysine ("Lys", "K") , aspartic acid ("Asp", *D"), 
glutamic acid ("Glu", "E") , cysteine ("Cys", "C") , 
tryptophan ("Trp", f, W") , arginine ("Arg", "R") and 
10 glycine ("Gly", W G") • 

As set forth in the examples of this 
application, E2 trans -activation repressors may be 
produced by random mutations and site-directed 
mutations in the C-terminal, 126-residue DNA binding 
15 domain of the E2 gene. Those mutations yielding 
functionally defective mutants or homologues of E2 
polypeptides may be isolated following transformation 
of a unicellular host strain carrying an E2 trans - 
activation reporter plasmid. The isolated mutations 
20 may then be analyzed in terms of: 

a) expression of a protein that is 
recognized by E2 antibodies and that has approximately 
the molecular weight (50 kD) expected for a full-length 
native E2 polypeptide; 
25 b) nucleotide sequence of the mutated 

region of the E2 gene ~ the region of the E2 gene that 
encodes the DNA binding domain — which is in the 
C-terminal region of the E2 polypeptide; 

c) capacity of the mutant (homologous) 
30 polypeptide corresponding to the C-terminal region of 

the native E2 polypeptide, to bind to E2 DNA binding 
sites; 

d) capacity of the mutant (homologous) 
polypeptide corresponding to th C-terminal region of 

35 the native E2 polypeptide, to dimerize with itself; 
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e) capacity of the mutant (homologous) 
polypeptide corresponding to the C-terminal regi n of 
the native E2 polyp ptid , to repr ss E2 -dependent 
trans -activation in eukaryotic cells. 
5 By virtue of this invention, for the first 

time, the E2 protein dimerization region — the regi n 
of the amino acid sequence in the E2 DNA binding domain 
that is responsible for dimerization independent of DNA 
binding interactions — was located between amino acids 

10 325 and 410 of the native E2 protein. In addition, it 
was recognized that the E2 protein dimerization region 
itself is sufficient to repress Jyans-activation by 
full-length E2 proteins and that DNA binding is not 
required for repression of E2 Jaana-activation • 

15 Further, it was discovered that several mutations in 
the E2 amino acid sequence abolish DNA binding without 
abolishing dimerization. 

According to one embodiment of this 
invention, an E2 trans -activation repressor comprises a 

20 polypeptide having an amino acid sequence homologous to 
the E2 DNA binding domain (SEQ ID N0:1), or homologous 
to a polypeptide fragment thereof, said polypeptide 
being capable of forming inactive heterodimers with the 
full-length native E2 polypeptides produced by the 

25 papillomavirus and said inactive heterodimers being 
incapable of binding to E2 DNA binding sites. 
Alternatively, an E2 icans-activation repressor of this 
invention consists essentially of a polypeptide having 
an amino acid sequence homologous to the native E2 DNA 

30 binding domain (SEQ ID N0:1), or homologous to a 

polypeptide fragment thereof, said polypeptide being 
capable of forming inactive heterodimers with the full- 
length native E2 polypeptide and said inactive 
heterodimers being incapable of binding to E2 DNA 

35 binding sites. 
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In an ther embodiment f this invention , an 
E2 trans -activation r pr ssor comprises a polypeptide 
that is a fragment of the native E2 DNA binding domain. 
Alternatively, an E2 trans -activation repressor of this 
5 invention consists essentially of a polypeptide that is 
a fragment of the native E2 DNA binding domain. 

In another embodiment of this invention, an 
E2 trans -activation repressor comprises an amino acid 
sequence selected from the group consisting of the 
10 amino acid sequence defined by SEQ ID NO: 3, the amino 
acid sequence defined by SEQ ID NO: 5, the amino acid 
sequence defined by SEQ ID NO: 7, the amino acid 
sequence defined by SEQ ID NO: 9, the amino acid 
sequence defined by SEQ ID NO: 11, the amino acid 
15 sequence defined by SEQ ID NO: 13, the amino acid 
sequence defined by SEQ ID NO: 15, the amino acid 
sequence defined by SEQ ID NO: 23, and the amino acid 
sequence defined by SEQ ID NO: 25 • E2 trans -activation 
repressors of this invention may also consist 
20 essentially of any one of those amino acid sequenc s. 

It should be understood that this invention 
also relates to E2 trains -activation repressors other 
than those defined by SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID 
NO:7, SEQ ID NO:9, SEQ ID NO:ll, SEQ ID NO: 13, SEQ ID 
25 N0:15, SEQ ID NO: 23 and SEQ ID NO: 25. More 

particularly, E2 trans -activation repressors according 
to this invention include polypeptides comprising 
fragments of the E2 DNA binding domain or amino acid 
sequences homologous to the E2 binding domain, so long 
30 as those polypeptides demonstrate the capacity to 
repress E2 trans -activation by interfering vith 
formation of active E2 homodimers. 

The E2 trans -activation repressors of this 
invention may be ch mically synth sized by conventional 
35 p ptide synthesis techniques, such as solid phase 
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synth sis (R.B. Merrifield, "Solid Phas Peptide 
Synth sis. I. Th Synth sis Of A T trapeptide" , J. Am. 
Chen. Soc . 83, pp. 2149-54 (1963)). Alternatively, 
they nay be prepared in appropriate hosts transformed 
5 with DNA sequences that code for the desired E2 
trans -activation polypeptide. For example, an E2 
trans -activation repressor of this invention may be 
prepared in a process comprising the steps of: 
a) culturing appropriate hosts that have been 

10 transformed with and which express a DNA sequence 
encoding that polypeptide; and b) recovering the E2 
trans -activation repressor from the culture. 

E2 trans -activation repressors according to 
this invention may also be produced by truncating a 

15 full-length native E2 gene, or a portion thereof, at 
various positions to encode a polypeptide that is a 
fragment of the native E2 binding domain and that 
comprises the E2 dimerization region, but lacks 
sequences necessary for DNA binding. For example, a 

20 papillomavirus E2 gene may be truncated so as to encode 
a polypeptide consisting of a sequence beginning 
between about amino acid 338 and amino acid 360, and 
ending at about amino acid 410. Such truncation of the 
full-length native E2 gene, or a portion thereof, may 

25 be accomplished by conventional techniques involving 
restriction digestion and oligonucleotide linkers, or 
by exonuclease digestion. A combination of such 
methods may also be employed to design E2 repressors 
other than those illustrated herein. 

30 When an E2 trans -activation repressor of this 

invention is produced by expression in a unicellular 
host transformed with a DNA sequence encoding the 
repressor, the DNA sequence should be operatively 
linked to an expression control sequence in an 

35 appropriate expression vector and employed in that 
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vector to transf rm an appropriate unic llular host. 
Such operativ linking of a DNA sequ nee encoding an E2 
trans -activation repressor of this invention to an 
expression control sequence, of course, includes th 
5 provision of a translation start signal in the correct 
reading frame upstream of that DNA sequence. If the 
particular DNA sequence to be expressed does not begin 
with a methionine, the start signal will result in an 
additional amino acid — methionine — being located at 

10 the N-terminus of the product. While such methionyl- 
containing E2 trans- activation repressors may be 
employed directly in the compositions and methods of 
this invention, it is usually more desirable to r move 
the methionine before use. Methods are available in 

15 the art to remove such N-terminal methionines from 

polypeptides expressed with them. For example, c rtain 
hosts and fermentation conditions permit removal of 
substantially all of the N- terminal methionine in vivo . 
Other hosts require in vitro removal of the N- terminal 

20 methionine. Such in vivo and in vitro methods are well 
known in the art. 

A wide variety of host /express ion vector 
combinations may be employed in expressing DNA 
sequences encoding the E2 trans -activation repressors 

25 of this invention. Useful expression vectors, for 
example, may consist of segments of chromosomal, non- 
chromosomal and synthetic DNA sequences, such as 
various known derivatives of SV40 and known bacterial 
plasmids, e.g., plasmids from E.coli , including col El, 

30 pCRl, pBR322, pMB9, pET-3A and their derivatives, wider 
host range plasmids, e.g., RP4, phage DNAs, e.g., the 

numerous derivatives of phage A, e.g., NM989, and other 
DNA phag s, .g., M13 and filamentous s ingle -stranded . 
DNA phages, yeast plasmids, such as the 2m plasmid or 
35 derivatives thereof, and vectors derived from combi- 
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nations of plasmids and phage DNAs, such as plasmids 
which hav be n modif i d t employ phage DMA or other 
expression contr 1 sequenc s. For animal c 11 
expression, ve prefer to use plasmid pJOD, which 
5 contains the adenovirus major late promoter augm nted 
by the presence of the SV40 enhancer (J. Barsoum, 
w Introduction of Stable High Copy Number DNA into 
Chinese Hamster Ovary Cells by Electroporation" , DNA 

and Cell Biol. , 9, pp. 293-300 (1990)). 

10 In addition, any of a wide variety of 

expression control sequences — sequences that control 
the expression of a DNA sequence when operative ly 
linked to it — may be used in these vectors to xpress 
DNA sequences encoding the E2 trans -activation 

15 repressors of this invention. Such useful expr ssion 
control sequences, include, for example, the early and 
late promoters of SV40, adenovirus or cyt omega 1 virus 
immediate early promoter, the X3Sc system, the trp 
system, the TAC or TRC system, T7 promoter whos 

20 expression is directed by T7 RNA polymerase, th major 

operator and promoter regions of phage A, the control 
regions for fd coat protein, the promoter for 3-phos- 
phoglycerate kinase or other glycolytic enzymes, the 
promoters of acid phosphatase, e.g., Pho5, the 

25 promoters of the yeast a-mating factors, the polyh dron 
promoter of the baculovirus system and other sequences 
known to control the expression of genes of prokaryotic 
or eukaryotic cells or their viruses, and various 
combinations thereof. For animal cell expression, w 

30 prefer to use an expression control sequence derived 
from the adenovirus major late promoter augmented by 
the presence of the SV40 enhancer. 

A wide variety of unicellular host cells are 
also useful in expressing DNA sequences encoding the E2 

35 tEans-activation repressors of this invention. These 
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hosts may include veil known eukaryotic and pr karyotic 
hosts, such as strains of E.coli , Pseudomonas . 
Bacillus , Strentomvces, Saccharomvces and other fungi, 
animal cells, such as Chinese hamster ovary ("CHO") and 
5 mouse cells in culture, African green monkey cells, 

such as COS 1, COS 7, BSC 1, BSC 40, and BMT 10, ins ct 
cells in culture, human cells in culture and plant 
cells in culture. For euiimal cell expression, ve 
prefer CHO cells. 

10 It should of course be understood that not 

all vectors and expression control sequences will 
function equally veil to express DNA sequences encoding 
the E2 trans -act i vat ion repressors of this invention. 
Neither will all hosts function equally veil with the 

15 same expression system. However, one of skill in the 
art may make a selection among these vectors, 
expression control sequences and hosts without undue 
experimentation and without departing from the scope of 
this invention. For example, in selecting a vector, 

20 the host must be considered, as the vector must 

replicate in it. The vector's copy number, the ability 
to control that copy number and the expression of any 
other proteins encoded by the vector, such as 
antibiotic markers, should also be considered. 

25 In selecting an expression control sequence, 

a variety of factors should also be considered. These 
include, for example, the relative strength of the 
. system, its controllability and its compatibility with 
the DNA sequence encoding the particular E2 

30 trans -activation repressor of this invention, 
particularly with respect to potential secondary 
structures. Unicellular hosts should be selected by 
consideration of their compatibility with the chosen 
vector, any potential toxicity of th product coded for 
35 on expression by the DNA sequences of this invention to 
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them, their seer tion charact ristics, their ability to 
fold prot ins correctly, their f mentation 
requires nts and the ase of purificati n of th 
products coded for on expression by ONA sequenc s 
5 encoding the particular E2 ££an£-activation repressor 
of this invention. 

The E2 trans -activation repressor 
polypeptides produced on expression of the DNA 
sequences of this invention may be isolated from the 
10 fermentation or animal cell cultures and purified using 
any of a variety of conventional methods. One of skill 
in the art may select the most appropriate isolation 
and purification techniques without departing from the 
scope of this invention. 
15 E2 trans-activation repressors according to 

this invention also include non-peptide chemicals ~ 
peptidomimetics — which are capable of specifically 
forming an inactive complex with native full-length E2 
polypeptides so as to prevent them from forming active 
20 homodimers and thereby blocking papillomavirus 

transcription and translation. And molecules that form 
a stable complex with E2 polypeptides so as to prevent 
them from forming active homodimers may be designed on 
the basis of 3 -dimensional data on the E2 dimerization 
25 domain. Three-dimensional data on the E2 dimerization 
domain may be obtained by X-ray crystallography. 

The structural motif represented in BPV1 by 

0 

amino acids 333 through 344 of the E2 protein is highly 
conserved among papillomaviruses, including human 
♦ 30 papillomaviruses (Giri and Yaniv, supra ) . Several 

papillomavirus E2 repressors of this invention comprise 
mutations in that highly conserved motif. It should be 
understood that the BPVl-derived E2 trans -activation 
repressors of this invention are useful in the 
35 tr atment of human papillomavirus infections. It 
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should be further under st od that the illustrativ 
processes for the production of E2 £cana-activation 
repressors from the bovine papillomavirus, BPVl, 
described in this application, may similarly be 
5 employed to produce E2 ^cans-activation repressors f r m 
human papillomaviruses, as demonstrated in Example 7. 

The processes and compositions of this 
invention may be used to treat any mammal, including 
humans. According to this invention, mammals are 
10 treated by the pharmaceutically acceptable 

administration of an E2 trans-activation repressor in a 
pharmaceutically effective amount and for a period of 
time sufficient to inhibit or lessen the spread of 
papillomavirus infection, to reduce the symptoms of the 
15 specific papillomavirus-associated disease, or to 
prevent their recurrence. 

Diseases which may be treated by the 
processes and compositions of this invention are those 
caused by the etiological agent, papillomavirus. Such 
20 diseases include, for example, epithelial malignancies, 
anogenital malignancies, such as cervical cancer, 
malignant lesions, benign lesions, papillomacarcinomas, 
papilloadenocystomas, papilloma neurophathicum, 
papillomatosis, cutaneous and mucosal papillomas, 
25 condylomas, oral, pharyngeal, laryngeal, and tongue 
papillomas, fibroblastic tumors and other pathological 
conditions associated with papillomavirus. The E2 
trans -activation repressors of this invention may also 
be used to treat epithelial and internal 
30 fibropapillomas in animals. In addition, the methods 
and compositions of this invention may be used for the 
recidivism prophylaxis of solid tumors. 

According to this invention, E2 trans - 
activation repressors may be in any pharmaceutically 
35 acceptable dosage form, including those which may be 
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administered intratum rally, per itumor ally, 
inter les ionally , intravenously , intramuscularly , 
subcutaneous ly r periolesionally, or by topical 
routes, to ^xert local therapeutic effects. 
5 Such dosage forms may include 

pharmaceutical ly acceptable carriers and adjuvants 
vhich are known to those of skill of the art. These 
carriers and adjuvants include, for example, ion 
exchangers, alumina, aluminum st ear ate, lecithin, serum 

10 proteins, such as human serum albumin, buffer 

substances, such as phosphates, glycine, sorbic acid, 
potassium sorbate, partial glyceride mixtures of 
saturated vegetable fatty acids, water, salts or 
electrolytes, such as protamine sulfate, disodium 

15 hydrogen phosphate, potassium hydrogenphosphate , sodium 
chloride, zinc salts, colloidal silica, magnesium 
trisilicate, polyvinyl pyrrolidone, cellulose-bas d 
substances and polyethylene glycol. Adjuvants for 
topical or gel base forms of E2 trans -activation 

20 repressors may, for example, be selected from th group 
consisting of sodium carboxymethylcellulose, 
polyacry lates , polyoxyethylene-polyoxypropy lene-block 
polymers, polyethylene glycol and wood wax alcohols. 
For all administrations, conventional depot forms may 

25 be used. 

The pharmaceutical compositions of this 
invention may be formulated using conventional methods 
to prepare pharmaceutically useful compositions. Such 
compositions preferably include at least one 

30 pharmaceutically acceptable carrier. See, e.g., 

Remington's Phar maceutical Sciences (E.W. Martin) . In 
addition, the compositions preferably include a 
pharmaceutically acceptable buffer, preferably 
phosphate buffered saline, together with a 

35 pharmaceutically acceptable compound for adjusting 
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isotonic pressure, such as, for xample, sodium 
chloride, mannitol or sorbitol. 

Pharmaceutical compositions according to this 

4 

invention may include one or more E2 trans -activation 
5 repressors as active ingredients. Alternatively, a 
composition containing one £2 trans -activation 
repressor may be administered to a patient in 
combination with, or sequentially with, a composition 
containing a different E2 t£2HS-activation repressor. 
10 The most effective mode of administration and 

dosage regimen of the E2 trans -activation repressor 
will depend upon the type of disease to be treated, the 
severity and course of that disease, previous therapy, 
the patient's health status and response to the E2 
15 repressor and the judgment of the treating physician. 
The E2 repressor may be administered to the patient at 
one time or over a series of treatments. 

According to one embodiment of this 
invention, papillomavirus-infected cells may be 
20 saturated with an E2 trans -activation repressor which 
forms inactive heterodimers with the native full-length 
E2 polypeptides produced by that virus, to interfer 
with the formation of active homodimers comprising 
native full-length E2 polypeptides, thus repressing 
25 viral transcription and replication. 

Depending on the severity of the 
papillomavirus infection or its associated disease, for 
parenteral regimens, a dose of between about 1 and 1000 
mg/kg body weight of the E2 trans -activation repressor 
30 may be administered to the patient, via one or several 
administrations, or released from a depot form per 
treatment- Alternatively, a dose of between about 1 
and 1000 ng/tol of the E2 trans -activation repressor may 
be administered to a patient per application via 
35 topical routes. 
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Acc rding to an alternate embodiment of this 
invention, an E2 irana-activation repr ssor may be 
administered s rially or in combination with other 
therapeutics used in the treatment of papillomavirus 
5 infections or diseases caused by them. Such 

therapeutics include interferons, such as IFN—7, IFN-a 
and IFN-fi derived from natural sources or produced by 
recombinant techniques, other cell mediators form d by 
leukocytes or produced by recombinant techniques such 

10 as for example, inter leukin-1, inter leukin-2 , tumor 
necrosis factor, macrophage colony stimulating factor, 
macrophage migration inhibitory factor, macrophage 
activation factor, lymphotoxin and fibroblast growth 
factor. Alternatively, the E2 trans -activation 

15 repressor may be administered serially or in 

combination with conventional therapeutic agents or 
regimens such as, for example, salicylic acid, 
podophyllotoxin, retinoic acid, surgery, laser therapy 
and cryotherapy. Such combination therapies may 

20 advantageously utilize less than conventional dosages 
of those agents, or involve less radical regimens, thus 
avoiding any potential toxicity or risks associated 
with those therapies. 

The E2 trans -activation repressors of this 

25 invention may be delivered to papillomavirus-infected 
cells either directly or indirectly. Direct delivery 
of E2 trans -activation repressors may be facilitated by 
chemical modification of the polypeptides themselves. 
One such modification involves increasing the 

30 lipophilicity of the E2 trans -act ivat ion repressor in 
order to increase binding to the cell surface, in turn, 
stimulating non-specific endocytosis of the prot in. 
Lipophilicity may be increased by adding a lipophilic 
moiety ( .g. , one or more fatty acid mol cules) to the 

35 E2 repressor. A wide vari ty of fatty acids may be 
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employed. F r example, the protein may be 
palmitoylated. Alt rnatively, a lipopeptid may be 
produced by fusion or cross -linking, to permit the E2 
repressor to resemble the natural lipopeptide from 
5 E.coli , tripalmitoyl-S-glycerylcysteil-seryl-serin , at 
its amino terminus. This lipopeptide has been shown to 
increase the uptake of fused peptides (P. Hoffmann 
et al., "Stimulation Of Human And Murine Adherent Cells 
By Bacterial Lipoprotein And Synthetic Lipopeptid 

10 Analogues", Immunobiol . . 177, pp. 158-70 (1988)). 

Lipophilicity may also be increased by ester if icat ion 
of the protein at tyrosine residues or other amino acid 
residues. And uptake of the E2 £cans-activation 
repressor may be increased by addition of a basic 

15 polymer such as polyarginine or poly lysine (W-C, Sh n 
and H.J.P. Ryser, "Conjugation Of Poly-L-Lysine Albumin 
And Horseradish Peroxidase: A Novel Method Of 
Enhancing The Cellular Uptake Of Proteins", Proc. Natl, 

Acad. Sci USA , 75, pp. 1872-76 (1978)). 

20 Because some uptake mechanisms for E2 trans- 

activation repressors may involve passage through 
lysosomes and since long half-life in the target cells 
is desirable, an E2 trans -activation repressor of this 
invention may be modified to increase its protease 

25 resistance and, in turn, the half-life of the 
polypeptide in circulation and cells. In one 
embodiment of the present invention, the protease 
resistance of an E2 trans -activation repressor is 
increased by incorporation of D-amino acids instead of 

30 L-amino acids at some or all residues of the 
polypeptide. In another embodiment , the amino 
terminus, or carboxy terminus, or both termini of an E2 
repressor are blocked by chemical modification. In a 
further embodim nt of this invention, lysosomal 

35 proteas s are inhibited by an E2 trans-activation 
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r pressor in a c mposition comprising a lysomotrophic 
agent, such as chloroquin , amantadine, monensin, 
methylamine, or ammonium chloride. 

Direct delivery of E2 Jaamft-activation 
5 repressors according to this invention may also be 
effected by the use of transport moieties, such as 
protein carriers known to cross cell membranes. For 
example, an E2 trans -act i vat ion repressor may be fused 
to a carrier protein, preferably by a genetic fusion 

10 which may be expressed in a system such as E.coli or 
yeast. According to one embodiment of this invention, 
the amino terminus of the E2 trans -activation repressor 
may be fused to the carboxy terminus of a transport 
moiety using standard techniques. 

15 Nucleotide sequences encoding such carrier- 

E2 t ^ans -act i va t i on repressor fusion proteins, 
operatively linked to regulatory sequences/ may be 
constructed and introduced into appropriate expression 
systems using conventional recombinant DNA procedures. 

20 The resulting fusion protein may then be purified and 
tested for its capacity to (1) enter intact eukaryotic 
cells and (2) inhibit E2 -dependent gene expression and 
viral DNA replication once inside the intact eukaryotic 
cells. 

25 In choosing a useful carrier protein, those 

of skill in the art will recognize the desirability of 
appropriate control experiments designed, inter alia, 
to test the possibility that the carrier portion of the 
fusion protein itself interacts with elements of the E2 

30 transcriptional regulation system. If the carrier 
portion of the fusion protein is found to have 
undesirable interactions, such as activation of E2- 
dependent enhancer elements, the portions of the 
carrier sequence responsible for these interactions 

35 should be identified and deleted in a way which permits 
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the sequence to retain its carri r capacity. 
Alternately, on f s veral conventional carrier 
sequences which do not interact with elements of the E2 
transcriptional regulation system can be substituted. 
5 Useful carrier proteins include, for example, 

bacterial hemolysins or "blending agents", such as 
alamethicin or sulfhydryl activated lysins. Other 
carrier moieties which may be used include cell entry 
components of bacterial toxins, such as Pseudomon^s 

10 exotoxin, tetanus toxin, ricin toxin, and diphtheria 
toxin. Also useful is melittin, from bee venom. Other 
useful carrier proteins include proteins which are 
viral receptors, cell receptors or cell ligands for 
specific receptors that are internalized, i.e., those 

15 which cross mammalian cell membranes via specific 

interaction with cell surface receptors, recognized and 
taXen into the cell by cell surface receptors. Such 
cell ligands include, for example, epidermal growth 
factor, fibroblast growth factor, transferrin and 

20 platelet-derived growth factor. Alternatively, the 
ligand may be a n on -pep tide, such as mannose-6- 
phosphate, which permits internalization by the 
mannose-6-phosphate receptor. The transport moiety may 
also be selected from bacterial immunogens, parasitic 

25 immunogens, viral immunogens, immunoglobulins or 
fragments thereof that bind to target molecules, 
cytokines, growth factors, colony stimulating factors 

* 

and hormones. A transport moiety may also be derived 
from the tat protein of HIV-l. 

30 As an alternative or addition to the above- 

described chemical modifications and protein carriers, 
which may be employed alone or in combination, other 
agents which allow penetration of the keratinized cell . 
layer may be employed to facilitate delivery of the E2 

35 trans -activation repressors of this invention to 
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papillomavirus-infect d cells. In t plcal 
applications , for exampl , the E2 £ranfi-activation 
repressor may be administered in combination with 
dimethy Isulf oxide , an agent which promotes penetrati n 
5 of cell membranes by substances mixed with it. Useful 
keratinolytic agents include, for example, salicylic 
acid, urea, and a-hydroxyacids . For such applications, 
the E2 trans -activation repressor and any other ag nt 
may be administered topically, in cream or gel form. 

10 Indirect delivery of an E2 jaains-activation 

repressor to papillomavirus- infected cells may be 
carried out by delivering a gene encoding an E2 trans- 
activation repressor, with appropriate expression 
control sequences, into those cells. A gene encoding 

15 an E2 trans -act i vat ion repressor may be introduced into 
target cells by treating the infected cells, for 
example, by scraping them to allow uptake of DNA, by 
electroporation , by direct injection, or through the 
use of defective recombinant viruses, such as 

20 retroviruses. For example, a DNA sequence encoding an 
E2 trans -activation repressor may be introduced into 
target cells using a retrovirus by transcribing the DNA 
sequence encoding an E2 trans -activation repressor into 
an RNA sequence and incorporating the resulting RNA 

25 sequence into a defective recombinant retrovirus. 

In order that the invention described herein 
may be more fully understood, the following examples 
are set forth. It should be understood that these 
examples are for illustrative purposes only and are not 

30 to be construed as limiting this invention in any 
manner. Throughout these examples, all molecular 
cloning reactions were carried out according to methods 
in T. Maniatis et al., Molecular Cloning; A Laboratory 
Manual . Cold Spring Harbor Laboratory (1982) r 

35 J. Sambrook et al., Molecular Cloning - A Laboratory 
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Manual . Cold Spring Harbor Laboratory (1982) or 
J. sambrook et al., Molecular Cloning - A Laboratory 
Manual. 2nd Ed. , Cold Spring Harbor Press (1989) , using 
enzymes obtained from Hew England Biolabs (Beverly, 
5 Massachusetts) , except where otherwise noted. We 

confirmed the integrity of all plasmid constructions by 
DMA sequencing. 

EXAMPLE 1 

Chemical Mutagenesis. Phenotypic Selection Of 
10 Mutants And Site-Directed Mutagenesis 

We cloned the full-length coding strand of 
the wild type BPV1 E2 gene from plasmid pC0-E2 (Hawley- 
Nelson et al. , su pra ) into the filamentous single- 
stranded DMA bacteriophage M13 strain mpl8 (Life 

15 Technologies, Inc., Gaithersburg, MD) and isolated 

single-stranded DNA for chemical mutagenesis of the E2 
protein DNA binding domain. See Chapter 4 of Sambrook 
et al., supra , for standard procedures pertaining to 
the use of bacteriophage M13. 

20 In summary, in order to generate a large 

number of mutants, we chemically mutagenized and 
reverse-transcribed one strand of the BPVl E2 gene, 
transferred the double-stranded segment into a wild 
type E2 yeast expression vector and isolated mutants 

25 that were limited in gene induction. Random mutations 
in the DNA encoding the 126-residue C-termirial DNA 
binding domain of the E2 protein were produced by 
chemical mutagenesis, essentially according to the 
method of R.M. Myers et al., "A General Method for 

30 Saturation Mutagenesis of Cloned DNA Fragments", 

Science , 229, pp. 242-47 (1985). The method of Myers 
et al., supra , involves brief exposure of single- 
stranded DNA to chemicals such as nitrous acid, formic 
acid, hydrazaine, or potassium permanganate, that 
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damage all four bases without damaging the 
ph sphodiester backbone of the DNA. 

More sp cifically, ve tr ated 20 ng of 
single-stranded M13 DNA containing the full-length E2 
5 gene with 1.3 mM potassium permanganate for between 
about 5 and 10 min. In a variation of the proc dure, 
we treated the single-stranded DNA with 12 M formic 
acid for between about 5 and 10 min. With either 
chemical reagent, we carried out the reaction at ro m 

10 temperature and stopped it by addition of 1/10 volume 
of 2.5 H sodium acetate (pH 7.0). We separated the 
chemically modified, single-stranded DNA from th 
reaction mixture by precipitating it twice with cold 
ethanol in the presence of yeast tRNA carrier. We 

15 further purified the chemically modified DNA by agarose 
gel electrophoresis. 

For second strand synthesis, we annealed 
synthetic oligonucleotide primers complementary to the 
single-stranded DNA 3' to the BstXl site which is at 

20 nucleotide 3881 in the 3 f non-coding region of the BPV1 
E2 gene. Any portion of this region may be used for 
priming, and the exact length of the primer is not 
critical — so long as the primer is of a sufficient 
length to form a stable duplex. Conditions for 

25 annealing of primers and techniques of primer extension 
are well known in the art. We used a primer having the 

sequence 5 1 AGCAACTAGTCCCAAG 3», (SEQ ID NO: 17) which 

is complementary to nucleotides 3904 to 3919 of BPVl. 
For primer extension, we used T7 polymerase (Sequenase 

30 2.0, U.S. Biochemicals, Cleveland, Ohio). The primer 
extension reaction was carried out at 37 °C for about 
1 hr, in the presence of all four dNTPs, according to 
the vendor *s recommendations. Alternatively, we used 
murine leukemia virus reverse transcriptase (Life 

35 Technologies, Inc., Gaithersburg, MD) at 40°C for 
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primer extension. Wh n the polymerase us d for s cond 
strand synth sis encountered a damaged base in the 
template strand, it incorporated any one of the four 
dNTPs. Random transitions and trans vers ions, involving 
5 all four bases, were, therefore, likely to be produced 
at potentially any point in the nucleotide sequence. 
Thus, synthesis of the complementary DMA strand led to 
mutation at sites on the coding strand where chemical 
reaction took place. 

10 We digested the primer extension products 

(i.e., double-stranded DNA) with restriction 
endonucleases Kpnl and BstXl to release a population of 
426 bp randomly mutagenized E2 gene fragments encoding 
the C-terminal region of the E2 protein. 

15 We purified the mutagenized fragments on an 

agarose gel and subcloned them into the wild type E2 
gene in yeast expression vector pYE2 (Morrissey et al., 
supra ) , replacing the corresponding wild type 426 bp 
KpnI-BstXI fragment with a mutagenized fragment. 

20 Plasmid pYE2 comprises a galactose upstream activating 
sequence ("GAL UAS W ) and downstream restriction sites 
such that the GAL UAS can conveniently be used to drive 
expression of the homologous E2 sequences. Gene 
expression from GAL UAS is induced by the presence of 

25 galactose but strongly repressed by glucose. Thus, 
expression of the E2 sequences and E2 homologous 
sequences may be regulated by choice of yeast cultur 
medium. In addition, piasmid pYE2 contains a ura gene, 
a selectable marker that permits growth of hosts on 

30 selective media lacking uracil. 

Figure 1 depicts the sequence of amino acids 
285-410 of the wild type BPV1 E2 protein (E.Y. Chen 
et al., "The Primary Structure and Genetic Organization 
of the Bovine Papillomavirus Type 1 Genome" , Nature . 

35 299, 529-34 (1982)). This region of the BPV1 E2 
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prot in is known as th DNA binding domain (Giri and 
Yaniv, sacra) • Figure 1 als depicts chang s in the 
amino acid s quence of th BPVl E2 DNA binding domain 
which characterized various E2 transcription repressor 
5 mutants prepared according to this invention. As shown 
in Figure 1, mutations were scattered throughout the 
DNA binding domain. Several of the dimerization 
defective E2 mutants were characterized by two or three 
nucleotide alterations from the native E2 protein 
10 sequence • 

The cysteine at position 340 of the wild type 
E2 protein in Figure 1 is present in all papillomavirus 
E2 proteins whose sequences are known. Furthermore, 
DMA binding activity of E2 protein is dependent on the 

15 presence of reducing agents. Accordingly, we generated 
mutations at position 340 in order to determine the 
criticality of that cysteine. We used site-directed 
mutagenesis to substitute the three other nucleotid s 
for G and for C in the TGC codon for cysteine 340 in 

20 the BPVl E2 gene. We performed the site-directed 

mutagenesis according to the method of Kunkel et al., 
"Rapid and Efficient Site-Specific Mutagenesis without 
Phenotypic Selection- , Methods Enzymol. 154, pp. 367- 
82 (1987). 

25 Figure 2 identifies various E2 trans - 

activation repressor mutants prepared according to this 
invention and summarizes their DNA binding, 
dimerization and repression activities, as assayed in 
the following examples. 

30 EXAMPLE 2 

Screening Mutati ons Bv trans-activation Of A Reporter 
Gene In Yeast 

We tested the E2 mutants for trans - 
activation of a reporter gene in an E2 trans -activation 
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report r strain* More specifically, we used pYE2, 
carrying the p pulati n of E2 mutations, to transform 
reporter strain BGW1-7A, a yeast strain which contained 
yeast reporter plasmid pBY-4 (Morr issey et al . , supra ) . 
5 Plasmid pBY-4 contains a B-galactosidase reporter g ne 
under the control of an E2 -dependent promoter, plus a 
Leu2 gene, which serves as a selectable marker. 
Expression of the B-galactosidase gene in pBY-4 was 
rendered E2 dependent by having it under the control of 
10 an appropriately placed cyc-1 minimal promoter preceded 
upstream by four E2 binding sites. We then selected 
for transf ormants that contained both plasmids on a 2% 
glucose/minimal medium without leucine and uracil. 
Transfer of cells to glucose minimal medium lacking 
15 leucine and uracil provided strong positive selection 
pressure for transf ormants harboring both plasmid pYE2 
and plasmid pBY-4, while leaving expression of E2 
sequences uninduced. 

Although not required, a step involving such 
20 selection pressure without expression of E2 sequences, 
immediately following transformation, is preferred, in 
order to exclude the possibility that expression of E2 
protein or E2 homologues might confer a selective 
disadvantage that would discriminate against the 
25 desired transf ormants in a mixed population. 

The colonies selected were then replica 
plated onto selective yeast minimal medium containing 
100 mH potassium phosphate (pH 6.9), 2% galactose, and 
0.004% X-gal. X-gal is a colorless B-galactosidase 
30 substrate (5-bromo-4-chloro-3-indolyl fl-glucoside) that 
yields a blue product upon cleavage by B-galactosidase. 
Galactose induced expression of the E2 gene carried on 
pYE2 and X-gal gave a color indication of E2 -dependent 
activity of the B-galactosidase gene carried on plasmid 
35 pBY-4. On media containing galactose and X-gal, 
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transformants xpr ssing trans-activating E2 horn loguei 
were blue, whil colonies expressing n n-activating 
h mologues wer light blue or whit . After incubating 
the cultures for 48 hours at 30°C, we visually assayed 
5 colony color, on a scale of 1-8 • Approximately 10-15% 
of the colonies were white or light blue. All E2 
mutants listed in Figure 2 were originally isolated as 
white or light blue colonies. White colonies harbored 
£Ean&-activation-abolishing mutations, while light blue 

10 colonies harbored £cans-activation-reducing mutations. 
Mutants 366Y/376L, 386W, 360S, 3991, 408*, 411*, and 
3SLI were isolated as light blue colonies in the 
initial screening. The other mutants listed in 
Figure 2 were isolated as white colonies. D?u:k blue 

15 colonies harbored either unmutated E2 sequences or 
E2 mutations that did not reduce E2 trans -activation , 
and thus were discarded. 

The E2 plasmid, pYE2, was isolated from each 
mutant clone and the mutagenized E2 insert of each 
20 clone was sequenced by standard methods. 

As detailed above in Example 1, the five 
mutants at cys340 did not arise from our screen, but 
were generated by site-directed mutagenesis. 

EXAMPLE 3 

25 Expression Of E2 Protein And E2 Homoloaues 

In Yeast And In E.coli 

We analyzed the selected light blue or white 
transformants for expression of full-length E2 proteins 
as follows. Since mutations resulting in premature 

30 termination codons or unstable E2 proteins were not 

* 

desired, we extracted total protein from cultures of 
the selected light blue and white colonies and tested 
that protein by standard immunoblot techniques. Only 
mutant clones that produced nearly wild-type levels of 
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protein that reacted with E2 antibodies and that also 
had a molecular weight of ab ut 50 JcD were further 



First, we cultured each of the selected light 
5 blue or white transformants in 50 ml of selective 
minimal medium containing 2% galactose, for 7 hrs at 
30*C. To extract E2 proteins, we harvested the cells 
by centrifugation and washed them with protein 
extraction buffer (200 mM Tris/HCl (pH 8.0), 400 mM 

10 ammonium sulfate, 10 mM magnesium chloride, 1 mM EDTA 
and 10% glycerol (v/v) ) • We then suspended the washed 
cells in 2 volumes of protein extraction buffer 
supplemented with 5 mM DTT and the following protease 
inhibitors: 1 mM PMSF, ImM TLCK, pepstatin and 5mM 

15 benzamidine hydrochloride. 

After addition of washed 0.45 mm diameter 
glass beads (about equal in volume to the yeast cell 
pellet) , we disrupted the yeast cells by vigorous 
vortexing 6 times, for 30 sec. each time. Heavy 

20 insoluble debris was removed by a first round of 
centrifugation and the supernatant clarified by 
centrifugation at about 13,000 x g for 1 hr at 4°C. He 
added an equal volume of cold, saturated ammonium 
sulfate solution to the clarified supernatant and 

25 allowed proteins to precipitate on ice for 15 min. The 
precipitated proteins were pelleted by centrifugation 
in a fixed angle JA-17 rotor at 13000 x g for 10 
minutes at 4°C and then dissolved in 50 til of 
solubilization buffer (25 mM Tris/HCl (pH 8.0), 2 mM 

30 EDTA, 20% glycerol (v/v) , 1 mM DTT and the same mixture 
of protease inhibitors used in the protein extraction 
buffer. 

We expressed the O terminal 126 amino acids 
(plus an N-terminal methionine residue) of the BPV1 E2 
35 polypeptide and the corresponding mutant polypeptides 
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(homologues) in th E.coli expression v ctor pET8C, as 
d scribed by Studi r, "Use of T7 RNA Polymerase to 
Direct Expr ssion of Cloned Gen s", Methods Bnzymol. , 
185, pp. 60-90 (1990), after creating a Kpnl sit 
5 immediately 3' to the Ncol site and AT6 codon. Mutant 
E2 sequences were then transferred into pET8C-E2 as 
KpnI-BstXl fragments. The pET8C-E2 expression vectors 
were induced to express the E2 homologues as described 
by Studier ( supra ) , following transformation of the 

10 expression host E.coli strain BL2l(DE3)pLYSS (Studier, 
su pra ) . We produced 50 ml cultures of the transformed 
expression host strain grown under inducing conditions 
and harvested the cells by centrifugation. 

The harvested cells were suspended in 4 ml of 

15 20 mM MES (pH 6.0) containing 1 mM PMSF and subjected 
to freeze-thaw lysis. Insoluble debris was removed by 
centrifugation. The E2 polypeptides and E2 homologues 
were partially purified by chromatography on 
S-Sepharose (Pharmacia-LKB, Piscataway, NJ) . We 

20 applied the protein solution to a 0.2 ml column of 
S-Sepharose that had been pre-equilibrated with the 
freeze-thaw lysis buffer. The column was washed with 
20 mM MES (pH 6.0), 100 mM NaCl, 5 mM DTT and 
1 mM EOT A. E2 polypeptides and E2 homologues wer then 

25 eluted from the column with a solution containing 20 mM 
MES (pH 6.0), 600 mM NaCl, 5 mM DTT, 1 mM EDTA, 10% 
glycerol (v/v) , imM PMSF, 1 nH pepstatin, 2 fig/ml 
leupeptin and 2 ng/.ml aprotinin. As described below, 
we then tested for the presence E2 homologues in the 

30 eluate by conventional immunoblot procedures, which may 
be carried out by those of ordinary skill in the art 
using standard techniques. 

We resolved proteins in the eluate according 
to molecular weight by SDS polyacrylamide gel 
35 electrophoresis. Following electrophoresis, we 
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transferred the resolved prot ins onto nitrocellulose 
membranes by standard blotting techniques. He then 
treated the nitrocellulose membranes with bovin s rum 
albumin to saturate non-specific protein binding sit s 
5 on the membrane and exposed the membrane to polyclonal 
rabbit anti-E2 serum at a serum dilution of 1:2500, for 
2 hours, at room temperature. After washing the 
membrane to remove unbound antibodies, ve visualized 
antibodies bound to electrophoretic protein bands via 
10 alkaline phosphatase conjugated to antibodies that bind 
to rabbit immunoglobulins. 

PWV Binding A?say? 

In order to determine which mutant E2 

15 polypeptides (homologues) bound to E2 DNA binding 
sites, we carried out DNA binding assays. 

We first mixed between about 0.5 and 4.0 nl 
of partially purified E2 polypeptides or homologues, at 
a concentration of about 1 ng/Ml (prepared as described 

20 in Example 3 above) with about 1.5 ng of poly dl-dC and 
about 300 ng of sheared salmon sperm DNA, in a total 
volume of about 20 pi, for 10 min. at 4°C. We then 
added between about 0.5 and 2.0 ng of end- label led DNA 
fragments (about 10,000 cpm/reaction) containing one, 

25 two or four E2 DNA binding sites and placed the mixture 
on ice. The DNA fragments (probes) containing E2 
binding sites consisted of Nsil restriction fragments 
from pBY-1 (one E2 binding site) , pBY-2 (two E2 binding 
sites), or pBY-4 (Morrissey et al., supra ) . After 30 

30 minutes, we added 1/10 volume of 20 mM Hepes (pH 7.5), 
20% glycerol (v/v) and 0.25% bromophenol blue to the 
DNA-protein mixture, for electrophoresis. We then 
resolved DNA-protein complexes from unbound DNA and 
protein by electrophoresis in 4-5% polyacrylamide gels 
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f r about 3-4 h urs, at 150 v. The electrophoresis 
buffer was 0.5 x TBE. Following electrophoresis, gels 
were dried and exposed to X-ray film. Our DNA binding 
assay was in accord with well-known methods (see 
5 generally: F. Ausubel et al., "Mobility Shift DNA 
Binding Assay Using Gel Electrophoresis 9 *, in Current 

(1988)). 

Figure 3 shows autoradiograms of 

10 electrophoresis gels from thes.e DNA binding "gel shift" 
assays. As depicted in that figure, the sample in each 
lane included radioactive DNA probe containing E2 
binding sites. The DNA in panels A and B contained two 
E2 binding sites, and the DNA in panel C contained 4 

15 DNA binding sites. Sample designations are as follows: 
"P", radioactive DNA probe containing E2 binding sites, 
in the absence of added protein; "wt", native E2 
repressor; "333", mutant polypeptide 333V; "337", 
mutant polypeptide 337L; "339", mutant polypeptide 

20 339M; "344", mutant polypeptide 344L; "360", mutant 
polypeptide 360S; "316", mutant polypeptide 316Y; 
"370", mutant polypeptide 3701; "340R", "340F", "340Y", 
"340S", "340G" and "3SLI" refer to mutant polypeptides 
having those designations. The basis of the assay is 

25 that protein bound to the DNA probe slows 

electrophoretic migration of the DNA. Thus, binding of 
protein to the DNA causes the DNA band to be "shifted" 
from its electrophoretic position observed in the 
absence of boUnd protein. Figure 3 shoWs that mutant 

30 polypeptides 333V, 337L, 339M, 344L, 340F, 340R, 340Y 
and 360S did not "shift" the electrophoretic position 
of the DNA probe, and thus they did not bind to the E2 
binding sites on the DNA probe. Mutants 333V, 337L, 
339M, 344L, 340R, 340Y, 340F, 403 all failed to stably 

35 bind the E2 DNA element in this gel-shift assay. 
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T further characterize hom logues that were 
diner ization def ctive, we performed super-shift DNA 
binding assays, using monoclonal antibodies to BPVl E2 
protein. The super-shift assays were carried out to 
5 determine whether dimerization defective E2 homologues 
would bind to E2 DMA binding sites when held together 
in pairs by monoclonal antibodies , to simulate 
dimerization. In the super-shift assays, we first 
incubated the E2 homologues on ice with between about 2 

10 to 4 Ml of culture medium (DulBecco's modified medium 
with 10% fetal calf serum) from a monoclonal antibody- 
producing hybridoma cell culture, for 30 min., before 
addition of labelled DMA. We then placed the mixture 
on ice. After 30 min., we added 1/10 volume of 

15 20mMHepes (pH 7.5), 20% glycerol (v/v) and 0.25% 

bromophenol blue to the DNA-protein-antibody mixture, 
for electrophoresis. Electrophoresis was as in the DMA 
binding assay described above. 

Figure 4 shows autoradiograms of 

20 electrophoresis gels from a super-shift DMA binding 
assay performed with dimerization-defective mutant 
polypeptide 3 60S. In that figure, Gel A shows that in 
the absence of anti-E2 monoclonal antibody, mutant 
polypeptide 360S did not bind to DMA probes having 1, 2 

25 or 4 E2 binding sites. Gel B shows that in the 
presence of anti-E2 monoclonal antibody, however, 
mutant polypeptide 3 60S did exhibit binding to DMA 
probes having 2 or 4 E2 binding sites. Gel A samples 
were as follows: "IP", DNA probe with one E2 binding 

30 site, in the absence of added protein; "lA", DNA probe 
with one E2 binding site, in the presence of mutant 
polypeptide 3 60S; "IB", DNA probe with one E2 binding 
site, in the presence of native E2 repressor; "2P n , DNA 
probe with two E2 binding sites, in the absence of 

35 added protein; "2A", DNA probe with two E2 binding 
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sites, in the presence of mutant polyp ptid 360S; 
"2B", DNA probe with two E2 binding sites, in th 
presence of native E2 repressor; "4P W , DNA probe with 
four E2 binding sites, in the absence of added protein; 
5 H 4A", DNA probe with four E2 binding sites, in the 
presence of mutant polypeptide 360S; n 4B", DNA probe 
with four E2 binding sites, in the presence of native < 
E2 repressor. Gel B samples were as follows: "IP", 
DNA probe with one E2 binding site, in the absence of 

10 added protein; "lA", DNA probe with one E2 binding 
site, in the presence of mutant polypeptide 3 60S and 
monoclonal antibody; W 1B W , DNA probe with one E2 
binding site, in the presence of native E2 repressor 
and monoclonal antibody; w 2P tt , DNA probe with two E2 

15 binding sites, in the absence of added protein; W 2A M , 
DNA probe with two E2 binding sites, in the presence of 
mutant polypeptide 360S and monoclonal antibody; M 2B", 
DNA probe with two E2 binding sites, in the presence of 
native E2 repressor and monoclonal antibody; "4P W , DNA 

20 probe with four E2 binding sites, in the absence of 
added protein; W 4A W , DNA probe with four E2 binding 
sites, in the presence of mutant polypeptide 3 60S and 
monoclonal antibody; ^4B", DNA probe with four E2 
binding sites, in the presence of native E2 repressor 

25 and monoclonal antibody. 

In the initial screening of mutants for loss 
of E2 trans -activation , we noted that while mutant 360S 
was unable to activate the promoters with one or two E2 
' binding sites, trans-activation was approximately 40% 

30 of native E2 protein control levels in assays involving 
four E2 binding sites. In DNA binding assays with DNA 
probes having one or two E2 binding sites, mutant 
polypeptide 3 60S had practically no DNA binding 
activity, but in assays with a DNA probe having four E2 

35 binding sites, 360S bound a small fraction — 
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approximately 1% of the DNA pr be. This suggested that 
the 3 60S mutati n might r tain slight residual 
diner ization activity. 

We complemented the dimerization defect f 

5 mutant polypeptide 360S with a monoclonal antibody, 
with the two antibody "arms" holding two 3 60S monomers 
in close proximity, to simulate dimerization. 
Monoclonal antibody (Mab) B202, whose epitope is 
immediately upstream from the DNA binding domain, or 

10 Mab B201, whose epitope maps further upstream between 
amino acids 160 and 220 of E2 protein, were includ d 
with the 360S polypeptide and DNA the probe. While 
monoclonal antibodies are preferred, polyclonal 
antibodies prepared by conventional techniques may also 

15 be employed in super-shift assays. The presence of Mab 
202 restored practically complete binding of 
polypeptide 3 60S to DNA probes having two or four E2 
binding sites. Mab 202 did not restore binding of 360S 
to the probe having only one E2 binding site (Figur 

20 4). Mab 201 was only 5-10% as effective in restoring 
binding of mutant polypeptide 360S to E2 binding sites 
(data not shown) . This was predictable, since the 
epitope of Mab 201 was further from the E2 DNA binding 
domain, which contains the E2 dimerization region, than 

25 was the Mab 202 epitope. We therefore expected Mab 201 
to be less efficient at holding the 3 60S dimerization 
regions together than was Mab 202. To exclude the 
, possibility that the monoclonal antibody binding 
restored DNA binding by altering the conformation of 

30 the 360S polypeptide, we performed super-shift assays 
on 360S with normal dimeric B202 antibodies, which bind 
two E2 polypeptides (or homologues) , and monomer ic B202 
antibody fragments, which bind only a single E2 
polypeptide, and therefore do result in simulated E2 

35 dimers. While the normal dimeric Mab 202 allowed 3 60S 
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to bind to DNA, th m nomeric form of Mab 202 did not 
rest re binding of 3 60S to DNA in gel shift DNA binding 
assays. Separate controls confirmed that th monomer ic 
form of Mab 202 did bind to E2 polypeptides. These 
5 data strongly support our belief that E2 monomers 
cannot bind to E2 DNA binding sites. 

Since the E2 binding domain has no primary 
sequence homology to that of any other known 
transcription factor, the amino acids of E2 responsible 

10 for DNA binding interactions ( M DNA contact subdomain") 
were unknown prior to the instant invention. 

Four of the mutant polypeptides of this 
invention, 333V, 337L, 339M and 344L, were isolated 
from a twelve amino acid span from positions 333 to 344 

15 of E2 protein, which is a highly conserved region of 
that protein among all papillomaviruses. All were 
isolated as white colonies on the initial screen 
(Example 2) . All failed to stably bind the E2 DNA 
element by gel-shift assay. Replacement of glycine 333 

20 with valine also prohibited dimer formation (see 

Example 5), but since the other mutations (337L, 339M, 
344L) existed as pre-formed dimers, we inferred that 
this region was responsible for DNA interactions. 
These latter three mutations altered the positively 

25 charged amino acids glutamine, lysine and arginine, 
recognized to be involved in protein nucleic acid 
interactions. 333V also appeared to be 
transcriptionally distinct from other dimerization 
defective mutants (described below) which demonstrated 

30 activity with four E2 binding sites, while 333V did 

not. The high conservation of glycine at this position 
suggested that it is critical for proper tertiary 
folding of the C- terminal portion of E2 protein. 

Comparison of the amino acid sequence of the 

35 region (amino acid residues 333 to 344) of the native 
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E2 polypeptide t the DMA binding domain of other 
transcripti n fact rs failed to rev al similarities to 
the helix-tum-helix, helix-loop-helix, homeodomain, 8- 
sheet, or zinc finger classes of DMA binding domains. 
5 This region of the E2 polypeptide includes several 
basic amino acids and no acidic residues, yet bears 
virtually no primary sequence homology to the basic 
region of the jun/fos family of transcription factors, 
which has been shown to be required for their DMA 
10 binding capability. In common with these, however, 

this E2 domain also contains a central cysteine (amino 
acid 340) • 

♦ 

The 340R, 340Y and 340F E2 mutations, in 
which cysteine 340 was replaced with arginine, tyr sine 

15 or phenylalanine, respectively, had comparable 
characteristics to the mutations isolated in this 
region by chemical mutagenesis and phenotypic 
selection. These failed to trans -activate the E2 
dependent promoter with one, two or four E2 elements, 

20 and were likewise defective for DNA binding by gel 

shift (Figure 3). These cysteine mutants were able to 
dimerize (see Example 5) . These data suggest that 
substitution of bulky amino acids at cysteine 340 
blocked DMA interactions, not through inhibition of 

■» 

25 protein-protein interactions, but through 

destabilization of the DNA contact subdomain. 

EXAMPLE 5 

In Vitro Dimerization Assay 

In order to determine which mutant E2 
30 polypeptides retained the capacity to form dimers, 

subunits of dimeric E2 proteins were covalently bound 
by standard cross-linking reactions* Reaction 
conditions were adjusted so that the covalent bonding 
between subunits of pre-existing dimers occurred 
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r adily, with minimal covalent bonding between 
m nomers. F llowing th cross-linking reaction, the 
standard t chnique of sodium dod cyl sulfate 
polyacrylamide gel electrophoresis ("SDS-PAGE") , which 
5 separates proteins on the basis of size, was empl yed 
to determine which E2 mutations yielded polypeptid s 
that formed dimers. Wild type E2 protein dimerizes in 
the absence of DNA. 

For use in the cross-linking reactions, we 

10 prepared crude extracts of E2 homologues from cultures 
of yeast clones (as described in Example 3, supra 1 . We 
then carried out cross-linking by exposing samples from 
those yeast crude extracts to an ultraviolet (354 run) 
light box, for 30 sec, at a distance of 1 cm. 

15 Following the cross-linking reactions, we 

prepared the protein samples for SDS polyacrylamide gel 
electrophoresis by adding SDS to a final concentration 
of 3% and A-mercaptoethanol to a final concentration of 
5%. We then heated the samples and maintained them at 

20 65°C for about 3 min. We used a 9% polyacrylamide gel 
to resolve proteins. Following electrophoresis, we 
transferred the resolved proteins onto nitrocellulose 
sheets using standard electroblotting techniques. W 
then detected mutant E2 monomers and cross-linked 

25 dimers (at about 50 kD and about 100 kD, respect iv ly) 
by immunoblot using polyclonal antisera to BPVl E2 
protein. 

The results of these assays are set forth in 
Figure 2. As shown in that figure, the dimerization- 

30 preventing mutations mapped from amino acid position 
360 to at least amino acid position 402 of E2 protein, 
with the dimerization region potentially extending 
almost to the end of the polypeptide (i.e., position 
410). Mutation 360S was the only single amino acid 

35 change that removed all dimerization activity. Th 
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360S mutation interestingly alt red a highly conserved 
tryptophan r sidue shar d among the papillomaviruses. 
Analyses by UV cross-linking of 360S utilizing yeast 
and bacterial expression vectors demonstrated the 
5 inability of this tryptophan point mutant to form 
diners. Mutant 360S was a very poor repressor, 
strongly supporting our belief that the dimerization 
function is required for repression of papillomavirus 
trans -activation and viral replication. Dimer ization- 

10 defective mutants, 3SLI and 402* likely had intact DNA 
binding domains, since like mutant 360S, 3SLI and (to a 
lesser extent) 402* could super-shift — they could 
bind DNA and give a shift in a band retardation assay 
in the presence of a monoclonal antibody that 

15 recognizes the DMA domain binding of E2 protein (see 
Example 4), We believe that this super-shift activity 
resulted from the E2 mutant polypeptides being held 
together as a simulated dimer by the antibody. Mutants 
such as 402* , which have small insertions or deletions, 

20 may have failed to dimer ize due to gross perturbations 
in protein folding. Thus, it is not clear whether the 
region of that mutation is directly involved in 
dimerization. 

E2 Dimerization Function 

25 We have partially characterized a previously 

published (Haugen et al., supra ) E2 mutant, 3812i 
(called 402* herein) f which is DKA binding-defectiv 
and inactive in our yeast trans -activation assay 
system, with even four E2 binding sites. Mutant 402* 

30 has an in-frame insertion of 4 amino acids at position 
402. Analysis of the biochemistry of this mutant 
revealed that it does not dimerize in vitro and does 

■ 

not bind to E2 binding sites on DNA in DNA binding (gel 
shift) assays. However, 402* can be complemented for 
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DMA binding with a polyclonal anti-E2 serum in a super- 
shift assay. These results suggest that the £2 protein 
dimerization region itself spans, or is affected by, 
the region from amino acids 360 to 402 of E2 protein. 
5 Other mutations in the E2 protein 

dimerization region were Isolated as light blue 
colonies on our initial screening. Mutants 3SLI and 
366Y/376L displayed intermediate levels of 
transcriptional activation, mutant 3991 converted a C- 

10 terminal methionine to isoleucine and this had slightly 
reduced ability to activate the E2 dependent promoters. 
Mutant 386W replaced a highly conserved arginine with 
tryptophan and was also found to be partially defective 
for promoter activation. Biochemical characterization 

15 of these reduced activation mutants demonstrated DNA 
binding activity in gel shift assays (Example 4) and 
formed dimers in vitro . In the repression studies, 
mutant proteins 386W and 3991 were efficient repressors 
(see Example 6) . These mutant polypeptides dimerize 

20 and bind DNA. Mutant protein 3SLI had a reduced level 
of E2 transcriptional repression. Biochemical studies 
demonstrated that it has decreased dimerization 
capability, but not as defective as the 360S mutant 
protein. Mutations 408* and 411* affected the 3' 

25 terminus of E2. While this segment of E2 is not highly 
conserved among the papillomaviruses, the loss of 
dimerization activity upon insertion of four amino 
acids at position 402 revealed the requirement of this 
region for dimerization. Nonetheless, both 408*, which 
, 30 had alteration of the last 3 amino acids and added an 
additional 8 residues, and 411*, which had the 
translational stop codon replaced with a leucine codon, 
resulting in an extra 22 Oterminal amino acid 
residues, retained £rans_-activation function in large 

35 part. Consistent with its defective dimerization, 
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mutant polypeptide 402* was not able to repress £2 
trans -activation. Mutant polypeptides 408* and 411* 
both can bind ONA and dimerize as 126 amino acid forms 
purified from the £x£fili expression host. 408* was a 
5 weak repressor, but 411* failed to repress. It is not 
clear why 411* does not repress £2 trans -activation , 
but ve believe that it may have reduced ability to form 
inactive heterodimers with full-length £2 proteins due 
to the 22 amino acid peptide fused to its carboxyl 
10 terminus. 

These genetic and biochemical analyses 
suggest that the region of E2 protein that interacts 
with DNA directly is between about amino acids 333 and 
344, and that dimerization activity is encoded by a 

15 complex domain that spans the segment between about 
amino acids 360 and 402. Accordingly, we believe that 
the DNA binding recognition and the dimerization 
functions of £2 proteins are separable and mediated by 
two novel motifs. A short basic region, unlikely to be 

20 helical, is required for DNA binding but not 

dimerization. While a conserved central cysteine in 
this motif is not necessary, this represents a critical 
position for modifying the DNA binding capacity of E2 
protein, since replacement of cysteine with large amino 

25 acids adds efficiently abrogated DNA binding. The 
dimerization motif includes a critical tryptophan at 
position 360 in BPV-i E2 polypeptide. . 

It should be noted that DNA binding capacity 
was lost by a mutant in every instance in which 

30 dimerization capacity was lost. These results indicate 
that dimerization is a prerequisite for DNA binding. 
However, some mutant gene products that lost the 
capacity for DNA binding retained the capacity for 
dimerization. In those mutant gene products, which 

35 represent the novel class of E2 trens-activation 
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r pr 8a rs f this invention , the dimerization function 
was separated from th DNA binding function. Mutants 
337L, 339M, 340F, 340R, 340Y and 344L ar included in 

this group. These mutants that dimerized without 
5 binding to E2 DNA binding sites were further tested for 
capacity to repress E2 -dependent trans -act i va t i on in 
cultured animal cells. 

Repression Of E2 -Deoendent trans-activation 
10 In Cultured Animal Cells 

We next assayed the capacity of E2 mutants 
that dimerized without binding to E2 DMA binding sit s 
to repress E2 -dependent trans -act i vat ion in cultured 
mammalian cells. In this assay, an E2 dependent 

15 reporter plasmid (characterized by either the gene 

encoding chloramphenicol acetyltransf erase (CAT) or the 
gene encoding human growth hormone (hGH) driven by a 
truncated SV40 promoter having three upstream E2 
binding sites) , the full-length wild type E2 trans - 

20 activator protein driven by the actin promoter and the 
mutant clones in an E2 -repressor format starting at 
nucleotide 3089 of BPV, (i.e., amino acid 160), also 
driven by the actin promoter, were simultaneously 
introduced into cultured mouse embryo fibroblast cells 

25 by well-established electroporation techniques • The E2 
repressor DNA was present at a four-fold excess. Th 
E2 repressor assay was performed at an E2 trans - 
activator level below the saturation level, sine high 
amounts of E2 repress transcription, perhaps by 

30 "squelching". Transfections which resulted in the 

greatest E2 inductions also gave the best E2 repression 
(see Table II, infra ) and the greatest reproducibility. 

The reporter plasmid was constructed so that 
expr ssion of the r porter gene was highly dependent on 
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E2 trans- activation (i.e., it comprised one, two or 
four E2 DMA binding sit s appropriately plac d relativ 
to the promoter and reporter gene coding sequence) . 
The choice of a reporter gene is largely a matter of 
5 convenience. In general, any gene whose expression, 
either directly or indirectly, results in a product 
that can be measured with reasonable accuracy and 
reliability can be used as a reporter gene. Preferred 
reporter genes for the assay of E2 £rajis-activation in 
10 the cultured mammalian cells according to this 
invention are the gene encoding hGH and the gene 

encoding CAT. 

We constructed the hGH reporter plasmid, 
pXB332hGH, in a two-step process (Figures 5 and 6) . 

15 First, we constructed pXB332 by inserting the E2- 

dependent promoter (Sall-Hindlll fragment) from the E2- 
dependent reporter plasmid pC515-9 (Hawley-Nelson 
et al., supra . (1988)) into plasmid vector pXBlOO (see 
Figures 5 and 7) that had been previously cleaved with 

20 Xhol and Hindlll, to form plasmid pXB332. We then 

inserted the hGH gene as a Hindlll-EcoRI fragment from 
pOGH (Nichols institute, San Juan Capistrano, 
California) into pXB332 that had been cleaved with 
Hindlll and EcoRI to create pXB322hGH (Figure 6). We 

25 constructed the CAT reporter plasmid according to a 
published method (P. Hawley-Nelson, supra) . 

The E2 trans -activator plasmid vector 
comprised a full-length native BPV1 E2 gene from pCO- 
E2 (Hawley-Nelson et al. , supra), operatively linked to 

30 control sequences that rendered its expression 

essentially constitutive. Thus, the E2 trans-activator 
plasmid directed synthesis of E2 protein for trans- 
activation of the reporter gene. In order to ensure 
that repressor effects were observable, however, the 

35 promoter controlling expression of the full-length E2 
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gene was not s active as to yield saturating levels of 
full-length E2 protein in the transfected mammalian 
cells of the repressor assay system. If the E2 trans - 
activator gene is over expressed, repression data ar 
5 unreliable. In a preferred embodiment of this 

invention, a chicken 6-actin promoter is employed for 
expression of the E2 trans-activator gene. 

We expressed the native E2 coding sequences 
and mutant E2 coding sequences from the chicken fl-actin 

10 promoter (T.A. Kost et al., "The Nucleotide Sequence of 
the Chick Cytoplasmic B-Actin Gene 11 , Nucl. Acids 
Res. , 11, pp. 8287-8301 (1983); A. Seiler-Tuyns et al., 
"Expression and Regulation of Chicken Actin Genes 
Introduced into House Myogenic and Hon-Myogenic Cells 11 , 

15 Proc. Natl. Acad, Sci. USA . 81, pp. 2980-84 (1984)) in 

animal cells using vector pXBlOl (See Figure 7). 

We constructed plasmid pXBlOl in a 2 -step 

process (Figure 7) • Two oligonucleotides were 

synthesized and annealed to form a poly linker: 

20 5 1 CTCGAGAAGCTTGACGGATCCG 3 1 (SEQ ID NO: 18) 

3 1 TGCAGAGCTCTTCGAACTGCCTAGGCTTAA 5» (SEQ ID NO: 19) 

This poly linker contained Xhol, Hindlll, and BamHI 
restriction sites internally, with an Aatll compatible 

* 

overhang at the 5' end and an EcoRI compatible end at 
25 the 3' end. We then cleaved plasmid pBG312 (R.L. Cate 
et al., "Isolation of the Bovine and Human Genes for 
Mullerian Inhibiting Substance and Expression of th 
Human Gene in Animal Cells", Cell , 45, pp. 685-98 

* 

(1986)) with Aatll and EcoRI to release a fragment 
30 containing the Ad-2 promoter of pBG312. We inserted 
the polylinker into the cleaved pBG312 in place of the 
Ad-2 promoter to form the promoterless vector pXBlOO, 
We then cleaved pXBlOO with Xhol and BamHI (exploiting 
the polylinker sites) and inserted the chicken B-actin 
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promoter from pBAct-1 (Kost et al., SUEta) as a 280 bp 
XhoI-BamHI fragm nt, to form pXBlOl. 

For expression of native full-length E2 
protein, ve inserted a 1866 bp BamHI fragment from pCO- 
5 E2 (Hawley-Nelson et al. supra ) into the BamHI site of 
pXBlOl, to form plasmid pXB323 (Figure 8) . 

A polypeptide consisting of the o terminal 
249 amino acids of the native E2 protein lacks trans - 
activation capacity and can repress tians-activation by 

10 native full-length E2 protein (P.F. Lambert et al., 
supra ) . For expression of the native BPV1 E2 
repressor, the 1362 bp NcoI-BamHI fragment of pXB323 
(encoding the C-terminal 249 amino acids of the BPV1 E2 
protein and starting with a methionine at the Ncol 

15 site), was inserted into the BamHI site of pXBlOl, to 
form pXB3l4 (Figure 9) . Synthetic oligonucleotides 
were also inserted in order to join the Ncol cohesive 
end of the 1362 bp fragment to the BamHI cohesive end 
of pXBlOl. Those synthetic oligonucleotides are shown 

20 below: 

5 9 GATCCPTTGCCGCCAC 3 1 (SEQ ID NO: 20) 

3 1 GAAACGGCGGTGGTAC 5 f (SEQ ID NO: 21) 

In order to test E2 homologues for their 
capacity to repress E2 trans -activation , mutant forms 

25 of the E2 DNA binding domain (from clones selected in 
the phenotype screening described in Examples 2 and 3 
above) were inserted as KpnI-BstXl fragments into Kpnl- 
, BstXI-cleaved pXB314 (see Figure 10) . In this way, th 
C- terminal 126 amino acids of the polypeptide 

30 consisting of the C-terminal 249 amino acids of the 
native E2 polypeptide were replaced with the 
corresponding mutant sequences to form each of the 
mutant E2 repressor plasmids assayed, including 

PEC337L, pE339K, PEC340F, pEC340R, pEC340Y and pEC344L* 
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All transf ections and assays were p rformed 
at subsaturating 1 vels of the E2 trana-activator. 
This was done by using the moderately weak act in 
promoter to drive expression of the protein. Unless 
5 otherwise indicated, all steps were carried out at room 

temperature • 

The transfections were carried out on the 
mouse embryo fibroblast cell line Balb/c 3T3, clone A31 
(S.A. Aaronson and G.J. Todaro, "Development of 3T3- 

10 Like Lines from Balb/c Mouse Embryo Cultures: 
Transformation Susceptibility to SV40", J. Cell 
Ph ysiol, , 72, pp. 141-48 (1968) ), obtained from the 
American Type Culture Collection (ATCC accession 
no. ATCC CCL163) . The 3T3 cell culture medium was 

15 DulBecco's minimal essential medium (Gibco, Grand 
Island, NY) , with 10% donor calf serum (Hazelton, 
Lenexa, KS) and 4 mM glutamine (Whittaker, 
Walkersville, MD) • We maintained the 3T3 cell cultures 
in an incubator at 37 °C, in an atmosphere containing 

20 5.5% C0 2 . Cells were grown in 100 mm culture dishes 

(Corning, Corning, NY, cat. no. 25020) • The cells were 
passaged by washing with phosphate-buffered saline 
solution (Gibco) and treatment with trypsin (Gibco) , 
(to remove adhering cells from the culture vessels), 

25 followed by addition of fresh culture medium and 
dilution of cultures into vessels containing fresh 
culture medium. 

Transient electroporations were carried out 
to measure the repression activity of the mutants. We 

30 employed a commercially available electroporation 

device (Gene Pulser™, BioRad, Richmond, CA) and used an 
electroporation technique similar to that of G. Chu 
et al., "Electroporation For the Efficient Transfection 
of Mammalian Cells With DNA", Nucl. Acids Res, , 15, pp, 

35 1311-26 (1987) to introduce plasmids into the 3T3 
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c lis. In ach electroporation, w us d a total of 400 
/jg of DNA. Of that 400 /*g, 20 M9 vas r port r plasmid, 
20 nq vas tyans -activator plasmid and 80 nq was 
repressor plasmid. The remainder of the 400 M9 vas 
5 made up with herring sperm DNA (Boehringer Hannh im, 
Indianapolis, IN), that had been sonicated to fragm nts 
of about 300 to 2000 bp in size. To a solution of the 
DNA (0.4 ml) we added NaCl to a final concentrati n of 
0.1 M and then we precipitated the DNA with 2.5 volumes 
10 of ethanol. We pelleted the precipitated DNA in an 
Eppendorf centrifuge, air-dried it in a tissue culture 
hood and resuspended the DNA in 0.8 ml of 20 mM H pes 
(pH 7.05), 137 mM NaCl, 5 mM KCl, 0.7 mM Na 2 HP0 4 and 
6 mM dextrose, ( n l x HeBS") . We allowed the DNA to 
15 resuspend in the 1 x HeBS from about 3 to 24 hours. 

For each electroporation, we removed about 
5 x 10 6 3T3 cells (that had been passaged or fed on the 
previous day) from a culture vessel by trypsin 
treatment and pelleted the cells by centrifugation at 
20 1000 rpm in a Damon/IEC HN-SII rotor (about 250 x g) 
for 4 min. After removal of the medium above the 
pelleted cells by aspiration, we resuspended the cells 
in the DNA plus 1 x HeBs (see above). We then 
transferred the solution containing the DNA and cells 
25 to an electroporation cuvette. We immediately 

discharged a 960 /iFD capacitor, to yield about 240 v 
for about 10 msec. We left the cells in the cuvette 
for about 8 min. and then transferred them to a test 
tube containing 10 ml of culture medium and pelleted as 
30 above. We then aspirated the medium, resuspended the 
cells in 10 ml culture medium, seeded them into a 10 cm 
plate and returned the plate to the cell culture 
incubator . 

When using the hGH reporter gene, we 
35 harvested the culture medium to assay for secreted hGH 
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after 48 to 72 hrs. Alternatively, when using the CAT 
reporter, w harvest d the electrop rated cells after 
48 to 72 hours. We controlled for cell number by 
counting cells, if using the hGH assay, and by 
5 measuring total protein concentration in the extracts, 
if using the CAT assay. 

In order to quantitate expression of the 
reporter gene, ve performed hGH assays according to the 

method of Selden, Protocols in Molecular Biology, 

10 Greene Publishing Associates, New York, pp. 9. 7,1 - 
9^7.2 (1987). For hGH assays, we used a commercially 
available kit (Allegro 1- Human Growth Hormone transient 
gene expression system kit, Nichols Institute, San Juan 
Capistrano, CA) . We performed CAT assays according to 

15 the method of CM. Gorman et al. "Recombinant Genomes 
Which Express Chloramphenicol Acetyltransferase in 
Mammalian Cells", Mol. Cell Biol.. 2, pp. 1044-51 
(1982)). Positive and negative controls were employed, 
as appropriate. Such controls included transfection of 

20 a reporter plasmid in the absence of a £EailS-activator 
plasmid (reporter background), transfection of a 
reporter plasmid in the presence of a jfc£ans-activator 
plasmid and absence of trans - act i va t i oh repressor 
plasmid (unrepressed trans -activation) and transfection 

25 of a reporter plasmid in the presence of a trans - 

activator plasmid and a plasmid for expression of the 
BPV1 E2 native repressor (i.e., the C-terminal 249 
amino acids of the native E2 polypeptide). 

In evaluating the E2 homologues, we utilized 

30 data on reporter gene background level, E2 induction 
level and repression produced by the BPV1 E2 native 
repressor. Reporter gene background activity was 
calculated as reporter activity in the absence of E2 
trans -activator protein. E2 induction level was 

35 calculated as reporter activity in the presence of E2 
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trans -activator protein divided by reporter background 
activity. Repression was calculated according t the 
following formula: 

f f ACT With B21 - BKffiJ - "ACT with E2 and REP) - BKG1 
5 (ACT with E2) - BKG 

■ • 

Wherein: 

ACT is activity of reporter 

BKG is background activity of reporter 

REP is repressor. 

10 Table I (below) provides an example of 

results calculated with the above formula.* 



TABLE I 

Tpansfection hGH (ua /ml) Induction Repression 

pXB332hGH 

15 (reporter) 0.1 

pXB322hGH + 
pXB323 

(reporter £ 10. 0 100-fold 

trans -activator ) 

20 pXB332hGH + 
PXB323 + 
pXB314 

(reporter & l.o 10-fold 90.9% 

trans -activator & 
25 native repressor) 

pXB332hGH + pXB323 
+ pXB314.360S 

(reporter & 7.0 70-fold 30.3% 

trans -activator & 
30 homologous repressor) 



* In this example, repression by the homologous 
repressor (360S) could also be expressed as 33% of the 
repression exhibited by the native repressor. 

35 Table I illustrates a convenient and valid means of 
comparing repressor activities, with appropriate 
controls taken into account. 
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Tabl II shows raw data and calculated values 
for several E2 repr ssor assays carried out as 
described above. In Table II, CPH represents sample 
radioactivity counts per minute; CPM - BKGD represents 
5 sample counts per minute minus background counts p r 
minute; % Repression represents the value calculat d 
for the mutant, from the formula above, multiplied by 
100; % of Native Repression represents the % Repression 
value for the mutant divided by the repression value 

10 for the native repressor, calculated from the formula 
above; C represents the hGH or CAT reporter plasmid; 
323 represents the trans -activator plasmid, pXB323; 314 
represents the native repressor plasmid, pXB314; and 
the numbers followed by an upper case letter or an 

15 asterisk refer to the mutant polypeptide sequence being 
tested (see Figure 2). 

TABLE II 

* 

E2 REPRESSION ASSAYS 

% OF NATIVE 

20 SAMPLE CPM CPM - BKGD % REPRESSION REPRESSION 

CAT ASSAY #3 

C 234 
C + 323 3,278 3,044 

C + 314 + 323 378 144 95.3 

25 C + 340S + 323 572.5 338.5 88.9 93.3 
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% OF NATIVE 

SAtKlSt C£M CPM : BKCD % REPRESSION REPRESSION 

CAT ASSAY #4 





C 


161 


.5 








5 


C + 323 


9,751 




9,589.5 


— 


— 




C + 314 + 323 


509. 


.5 


348 


96.4 


• <m m 




C + 337L + 323 


969. 


5 


808 


91.6 


95.0 




C + 340R + 323 


538 




376.5 


96.1 


99.7 




C + 360S + 323 


9,096 




8,934.5 


6.8 


7.1 


10 


C + 402* + 323 


5,868. 


5 


5,707 


40.5 


42.0 




CAT ASSAY #5 














C 


292 












C + 323 


5,914 




5,622 




m> m 




C + 314 + 323 


398 




106 


98.1 


4» tm 


15 


C + 339M + 323 


1,738 




1,446 


74.3 


75.7 




C + 344L + 323 


579 




287 


94.9 


96.7 




C + 360S + 323 


2,656 




2,364 


57.9 


59.1 




C + 402* + 323 


3,102 




2,810 


50.0 


51.0 
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SAMPLE CPM 
CAT ASSAY #6 

C 122.5 

5 C + 323 2,867 

C + 314 + 323 218 

C + 337L + 323 418.5 

C + 339M + 323 725.5 

C + 3701 + 323 313.5 

10 C + 3SLI + 323 1,800 

C + 3991 + 323 456 

C + 366Y/ + 323 2,705.5 
376L 

CAT ASSAY #EP6 

15 C 377 

C + 323 3,906 

C + 314 + 323 1,053 

C + 316Y + 323 3,607 

C + 340Y + 323 756 

20 C + 344L + 323 1,429 

C + 3701 + 323 3,214 

C + 3SLI + 323 2,524 
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% OF NATIVE 

CPM - BKGD % REPRESSION REPRESSION 



2,744.5 

95.5 96.5 

296 89.2 92.4 

603 78.0 80.9 

191 93.0 96.4 

1,677.5 38.9 40.3 

333.5 87.8 91.0 

2,583 5.9 6.1 



3,529 

676 80.8 

3,230 8.5 10.5 

379 89.3 110.5 

1,052 70.2 86.9 

2,837 19.6 24.3 

2,147 39.2 48.5 
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% OF NATIVE 

SAMPLE CPM Cm - BKCD % REPRESSION REPRESSION 

hGH ASSAY #2 





c 


160 

4«ka 








5 


C + 323 


9,937 


9,777 


m m • 


— 




C + 314 + 323 


415 


255 


97.4 


--- 




C + 337L + 323 


862 


702 


92.8 


95.3 




C + 340R + 323 


452 


292 


97.0 


99.6 




C + 344L + 323 


1,680 


1 , 520 


84.4 

• 


86.7 


10 


C + 360S + 323 


7,925 


7,765 


20.6 


21.1 




C + 3701 + 323 


8,175 


8,015 


18.0 


18.5 




nun Aa^AI wj 








- 




C 


303 










C + 323 


14,522 


14,218 




a a» • 


15 


C + 314 + 323 


2,237 


1.934 


86.4 






C + 317ST0P + 323 


12,830 


12,527 


11.9 


13.8 




C + 333V + 323 


19,853 


19,550 


0 


0 




C + 339M + 323 


16,891 


16,588 


0 


0 




C + 340F + 323 


2.629 


2,326 


83.6 


96.8 
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% OF NATIVE 

SAMPLE CPH <?PM - BKGD $ REPRESSION BEEEESSIflH 

hGH ASSAY #5 





c 


239 








5 


C + 323 


2.455 


2,216 





... 




C + 314 + 323 


473 


234 


89.4 






C + 316Y + 323 


O 0 07 
2,20/ 




7 A 


ft s 

O.J 




M ****** »» ** *% A 

C + 333V + 323 


4,275 


4,036 


0 


ft 




C + 340G + 323 


486 


247 


88 .8 


Aft o 

99 . 3 


10 


C + 408* + 323 


1,219 


980 


55.8 


62.4 




C + 411* + 323 


4,756 


4,517 


0 


0 




hGH ASSAY #6 








♦ 




C 


169 




™ • ™ 


m a — 




C + 323 


J , 2U / 


C MO 

j , UJo 








C + 314 + 323 


OQO 

2o2 


111 


Q7 ft 






L + J*HJI + 32J 




i on 


07 6 






C + 386W + 323 


1,147 


978 


80.6 


82.4 




C + 408* + 323 


2.024 


1,855 


63.2 


64.6 




hffl ASSAY #7 










20 


C 


110 








* 


C + 323 


1,692 


1,582 


m m m 






C + 3701 + 323 


335 


225 


85.8 


1 
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% OF NATIVE 

SAMPLE SIX. CPM - BKGD % REPRESSION REPRESSION 

hCH ASSAY # EP5 





c 










5 


C + 323 


1 1 OAT 
11, ZU/ 




• 






C + 314 + 323 


lot 


Die 


OA. A 






C + 317STOP + 323 


10 , 943 


10,733 


2.4 


2.5 




C + 339M + 323 


2.376 


2,166 


80.3 


84.7 




C + 340F + 323 


1,475 


1,265 


88.5 


93.4 


10 


C + 340G + 323 


807 


597 


94.6 


99.7 




C + 340R + 323 


763 


553 


95.0 


100.2 




C + 340S + 323 


1,290 


1.080 


90.2 


95.1 




C + 366Y/ + 323 
376L 


8 , 314 


8,104 


26.3 


27.7 


15 


C + 386W + 323 


2,151 


1,94 


182.3 


86.9 




C + 3991 + 323 


1,750 


1,540 


86.0 


90.7 




C + 411L + 323 


10 , 206 


9,996 


9.1 


9.6 



- Ratio of Repressor to trans - activator - 4:1 



20 * Dashes indicate not applicable. 

- Question mark indicates that % of Native Repression could not be 
calculated, because no control with the native repressor plasmld 
(C + 314 + 323) was done in that assay. 
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Repression by a four-fold excess (by weight) 
of the native repress r was never below 80% in any 
assay. Reproducibility of the assay r suits shown in 
Table II was generally high. Mutant polypeptide 339M 
5 did not repress at all in one assay but gave good 

repression in three other assays, when a different DNA 
preparation was used. 

He tested each E2 homologue between two and 
four times for its ability to repress E2-dependent 

10 trans -activation in mammalian cells. A compilation of 
trans -activation repression assay results is shown in 
Table III below. The repression activity is also 
summarized in Figure 2. It is clear from these assays 
that the C-terminal portion of E2 protein need not be' 

15 able to bind DNA in order to repress. 
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TABLE III 



ffMMBfllT of Mutant E2 Repr essor A ctivity 1 



10 



15 



20 



25 



Mutant 


Repression as % 
Decrease in Trans - 
Activation Bv E2 


Repression As 
% of Repression 

Bv WT Repressor 2 


±1 


316Y 


8.0 




9.5 


— 


3 17 STOP 


7.1 




8.1 


_ 


333V 


0 3 








337L 


90.4 




93.7 


+ 

■ 


339M 


76.2 


• 


78.3 


+ 


340F 


86.1 




95. 1 

» *mf • all. 


+ 

■ 


3406 


91.7 




99.5 


• 


340R 


95.6 




99.9 




340S 


89.6 




94.2 

^ • mm 


+ 


340Y 


93.5 




105.1 


+ 


344L 


82.6 




91.8 


+ 


3 60S 


32.3 




33-1 




366Y/376L 


16.1 




16.9 




3701 


66.1 




70.9 




374S/375L/ 


39.0 




44.4 




3911 (3SLI) 








386W 


81.4 




84.6 


+ 


3991 


86.9 




90.8 


+ 


402* 


45.2 




46.6 




(4 AA insert) 








408* 


59.5 




63.5 




(11 AA insert) 






411* 


4.5 




4.8 





30 1 All values represent the average of two to four 
assays • 

2 Each mutant repressor was compared to the native 
repressor in the same assay. 

3 H 0 n indicates that activation was slightly greater 
35 in the presence of this mutant than in the control 

having no repressor present. 



Table III shows a compilation of the results 
of all mutants which have been tested for repression. 
We arbitrarily defined a repressor as a protein which 
40 repressed by at least 70% at a four- fold excess. 



4 • 

WO 92/12728 



PCT/US92/00652 



- 63 - 

Mutants 337L, 340F, 340R, 340Y and 344L, all of which 
could din rize but did not bind DNA, repressed 
essentially as v 11 as th native repressor. 
Dimerization-defective mutants 3 60S, 3SLI and 402* did 
5 not repress. Hutants 316Y, 411*, and 366Y/376L did not 
repress, despite the fact that they were capable of 
forming dimers in vitro . However, mutant polypeptide 
316Y appeared to be very unstable, suggesting that 
mutant polypeptides 316Y, 411* and 366Y/376L may have 

10 failed to repress as a result of their presence in the 
cells at very low concentrations, due to instability. 

Thus, it appears that DNA binding is not 
necessary for repression of E2 trans -activation . 
Instead, a mechanism other than competition for DNA 

15 binding sites operates. Mutants which cannot 

dimerize — or which do so very weakly — 317STOP, 
333V, 360S, 3SLI and 402* (4 AA insert) either repress 
poorly or not at all. We believe that the repress rs 
of this invention act through dimer formation. More 

20 specifically, we believe that the repressors of this 
invention form heterodimers with the full-length E2 
protein and thereby sequester it in an inactive form. 

Repression of HPV E2-Denende nt trans-Activation 
25 in Cultured Animal Cells 

There is a high level of homology between the 

E2 binding domains of BPV1 and human papillomaviruses 

4 

( W HPV M ). Accordingly, we tested the ability of 
homologous HPV E2 sequences to function as E2 trans - 
30 activation repressors in a manner similar to that 
observed with the BPV E2 repressors described in 
Example 6, supra . To do this, we constructed v ctors 
to express full-length native (trans-activating) HPV E2 
protein and putative HPV E2 repressors. 
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For expr ssion of a full-length HPV E2 gene, 
we construct d plasmid pAHE2 (Fig. 11) . Plasmid pAHE2 
contains the E2 gene from HPV strain 16 operatively 
linked to the adenovirus major late promoter augmented 
5 by the SV40 enhancer upstream of the promoter. He 

isolated the HPV E2 gene from plasmid pHPV16 (the full- 
length HPV16 genome cloned into pBR322), described in 
M. Durst et al., "A Papillomavirus DNA From Cervical 
Carcinoma And Its Prevalence In Cancer Biopsy Samples 
10 From Different Geographic Regions", Proc. Natl. Acad. 

Sci. USA , 80, pp. 3812-15 (1983) as a Tthllll-Asel 
fragment (Tthllll cleaves at nucleotide 2711, and Asel 
cleaves at nucleotide 3929 in the HPV16 genome) • We 
blunted the ends of the Tthllll-Asel fragment in a DNA 

15 polymerase I Klenow reaction and ligated BamHI linkers 
(New England Biblabs, cat! no. 1021) . We insert d this 
linker-bearing fragment into BamHI -cleaved plasmid 
pBG331, to create plasmid pAHE2 . 

Plasmid pB6331 is the same as pBG312 (Example 

20 6; Fig. 7) except that it lacks the BamHI site 

downstream of the SV40 polyadenylation signal, making 
the BamHI site between the promoter and the SV40 intron 
unique. We removed the unwanted BamHI site by partial 
BamHI digestion of pBG312, gel purification of the 

25 linearized plasmid, blunt end formation by DNA 
polymerase I Klenow treatment, self-ligation and 
screening for plasmids with the desired deletion of the 
BamHI site (tig. 11) . 

To provide a positive control (i.e., an E2 

30 repressor) for comparisons in the HPV E2 repression 
assays, we cloned the DNA sequence ( M E2R fragment n ) 
encoding the 249-amino acid BPV E2 repressor into 
plasmid pBG331, to create pBG331E2R. To construct 
pBG331E2R, we removed the E2R fragment from pXB323 

35 (Example 6; Fig. 8) by BamHI digestion, and then 
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inserted that fragment into BamHI -cleaved pBG33l. We 
used the sam procedure to construct plasmid 
pBG331E2RN, a n gativ control, t express th 
diner ization-def icient BPV E2 sequence designat d 360N. 
5 Since it expresses a dimerizat ion-deficient E2 

polypeptide which does not repress E2 trans -activation f 
we incorporated pB6331E2RN into our repression assays 
to serve as a negative control. Plasmid pBG331 E2RN 
expresses a BPV1 E2 repressor with a tryptophan to 

10 asparagine mutation at amino acid residue 360. This 
analogue is dimerization defective. It is similar to 
analogue 360S, described in Example 5, supra , but 
exhibits lower dimerization activity. 

Based on our comparison of the BPVl and HPV16 

15 sequences, we expected the C-terminal 83 amino acid 
residues of the HPV E2 protein to exhibit dimerization 
and DNA binding activity, and thus to repress E2 trans - 
activation. We therefore constructed plasmid pHE2-85 
from expression plasmid pBG331 (Figs. 12 and 13) by 

20 inserting a 260 base pair NcoI-BamHI fragment ("E2-85 M ) 
containing methionine and alanine codons immediately 
followed by codons for the C-terminal 83 amino acids of 
HP VI 6 E2. Similarly, we constructed plasmid pHE2-l23 
(Figs. 14 and 15) by inserting into pBG331 a 374 base 

25 pair fragment ( W E2-123 W ) containing methionine and 

valine codons immediately followed by codons for the C- 
terminal 121 amino acids of HPV 16 E2. 

For construction of plasmids pHE2-85 and 
pHE2-123, we produced the necessary DNA fragments by 

30 standard polymerase chain reaction ("PCR") techniques 
with pHPV16 as the template. PCR chemicals and 
equipment are commercially available. For a gen ral 
discussion of PCR techniques, see Chapter 14 of 
Sambrook et al., Molecular Cloning - A Laboratory 

35 Manual, 2nd Ed. . Cold Spring Harbor Press (1989). The 
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nucleotid s qu nc of EA57, the PCR oligonucl otide 
prim r for th 5 1 nd of th 260 bas pair E2-85 
fragment, is set forth in the Sequence Listing under 
SEQ ID NO: 28. The nucleotide sequence of EA52, th PCR 
5 oligonucleotide primer for the 5' end of the 374 base 
pair E2-123 fragment, is set forth in the Sequenc 
Listing under SEQ ID NO: 29. The nucleotide sequence of 
EA54, the PGR oligonucleotide primer used for th 3 1 
end of both fragments, is set forth in the Sequence 
10 Listing under SEQ ID NO: 30. We digested the PCR 

products with Ncol and BamHI and cloned the resulting 
fragments into Ncol /BamHI -digested expression plasmid 
pET8c (Studier et al, supra ) , to create plasmids pET8c- 
85 and pET8c-123. As depicted in Figures 13 and 15 , we 
15 cleaved pET8c-85 and pET8c-123 with Ncol and BamHI and 
transferred the fragments E2-85 and E2-123 into the 
unique BamHI site of plasmid pB6331, with the use of 
the BamHI -Ncol linker described in Example 6 (SEQ ID 
NO: 20 and SEQ ID NO: 21) , to create plasmids pHE2-85 
20 and pHE2-123. 

We performed repression assays by transient 
transfections of mouse fibroblast 3T3 cells, using 
reporter plasmid pXB332hGH, as described in Example 6, 
supra . The HPV E2 repression assay results are shown 
25 in Table IV, below. In Table IV, C represents the 
reporter plasmid, pXB332hGH. The other plasmids co- 
transfected in the repression assays have the same 
designation in Table IV as in the foregoing discussion. 
CPM represents sample radioactivity counts per minute; 
30 CPM - BKGD represents sample counts per minute minus 
background (reporter plasmid alone) counts per minute; 
INDUCTION represents counts per minute for the 
combination of reporter plasmid with the trans - 
activator plasmid (pAHE2) , divided by counts per minute 
35 for the r porter plasmid alone; % REPRESSION represents 
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the valu calculated fr m th formula in Example 6, 
multiplied by 100. 

TABLE IV 



HPV E2 REPRESSION ASSAYS 



5 hCH mag HE2.4 



15 C 



SAMPLE 




CPM-BKGD 


INPPCTIW 


| REPRESSION 


C 


461 


m m » 


• m m 





C + pAHE2 


3,726 


3,265 


8.1 





C -1- pAHE2 + pBG331E2R 


433 


0 


0 


100.0 


C + pAHE2 + pBG331E2RN 


3,815 


3,354 


8.3 


0 


C + pAHE2 + pHE2-85 


879 


418 


1.9 


87.2 


C + pAHE2 + pHE2-123 


422 


0 


0 


100.0 

* 


hGH assav HE2.5 














end -Kvnn 




* Harness ium 


c 


513 








C + pAHE2 


2,760 


2.247 


5.4 




C + pAHE2 -1- pBG331E2R 


549 


36 


1.1 


98.4 


C + pAHE2 + pBG331E2RN 


2.405 


1,892 


4.7 


15.8 


C + pAHE2 + pHE2-85 


1,016 


503 


2.0 


77.6 


C + pAHE2 + pHE2-123 


488 


0 


0 


100.0 


• 

- Ratio of Repressor to 


trans 


-activator 


- 4:1 





- Dashes indicate not applicable. 



We observed a 5 to 8-fold level of trans - 
activation by full-length HP VI 6 E2 protein. This was 
approximat ly 10-fold lower than the lev 1 of trans - 
activation shown by full-length BPV1 E2 protein. To 
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compensat for the lover 1 v 1 of £can§-activation, we 
used expr ssion plasmid pBG331 for the HPV repr ss ion 
assay plasmid constructs. Plasmid pBG33l has a 
stronger promoter and therefore presumably expresses 
5 higher levels of E2 protein than pXBlOl, which we used 
for plasmid constructs in the earlier BPV E2 repr ssion, 
assays described in Example 6. 

As expected, the 249 amino acid BPV1 E2 
repressor (positive control) completely repressed 
10 trans -activation by the full-length HPV16 E2 protein , 
and the dimeriz at ion-defective BPV1E2RN analogue 
(negative control) shoved negligible repression. Also 
as expected, the C-terminal 85 and 123 amino acids of 
HPV16 E2 protein functioned as repressors. The 85 
15 amino acid HPV16 E2 protein fragment gave 87.2 and 
77*6% repression, vhile the 123 amino acid HPV16 E2 
protein fragment gave 100% repression in both assays. 
We believe that these HPV E2 repressors would maintain 
their repression activity upon introduction of 
20 mutations that destroy DNA binding without inhibiting 
dimerization. Examples of such mutations are cysteine 
to arginine at position 300 in HVP16 E2 (which 
corresponds to position 340 in BPV1 E2), and arginine 
to leucine at position 304 (which corresponds to 
25 position 344 in BPV1 E2) . 

EXAMPLE 8 

Cellular Uptake of E2 Repressor Proteins 

The E2 repressor proteins are not taken into 
cells at detectable levels. Thus, to deliver an E2 
30 repressor protein into cells , we fused it to tat, a 
protein which naturally enters cells. Tat is a small, 
basic protein encoded by human immunodeficiency virus 
type I ("HIV-I"). It binds to the cell surface and 
gains entry by non-specific endocytosis in all cell 
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types so far t sted. Uptake of mor than 10 million 
tat mol cules per cell hav be n observ d. E2 
repressors can be link d to tat by expression of a 
recombinant fusion protein in transformed host c lis or 
5 by chemical cross-linking of the two proteins. W have 
produced a tat-E2 repressor fusion protein in E. coll 
by recombinant DNA techniques. 

From plasmid pXB314 (Example 6; Fig. 9) ve 
isolated the Ncol-Spel DNA fragment encoding the 249 

10 amino acid BPV1 E2 repressor, E2R. (Ncol cleaves at 
nucleotide 296, and Spel cleaves at nucleotide 1118 of 
pXB3l4.) We blunted the ends of this fragment by DNA 
polymerase I Klenov treatment and added the Bglll 
linker (New England Biolabs, cat. no. 1090) described 

15 in Example 6 ( supra ) and Fig. 9 (SEQ ID NO: 20 and SEQ 
ID NO; 21) . We inserted this linker-bearing fragment 
into BamHI-cleaved (complete digestion) plasmid pTAT72. 
Plasmid pTAT72 is described in A. D. Frankel and 
C. 0. Pabo, "Cellular Uptake Of The Tat Protein From 

20 Human Immunodeficiency Virus", Cell . 55 , pp. 1189-94 
(1988). In plasmid pTAT72 there is a BamHI cleavage 
site within the tat coding region, near its 3 1 end, and 
a second BamHI cleavage site slightly downstream of th 
tat gene. The Bglll linker joined the tat and E2 

25 coding sequences in frame to encode a fusion of the 
first 62 amino acids of tat protein followed by a 
serine residue and the last 249 amino acids of BPV1 E2 
protein. . We designated this bacterial expression 
plasmid pFTESOl (Fig. 16) . We expressed the tat-E2R 

30 fusion protein in E. coli strain BL21 (DE3) as 

described in Studier et al., (supra). We purified the 
tat-E2R fusion protein from the insoluble fraction of 
E. coli according to the following procedure. 

W pelleted the bacteria and resuspended them 

35 in ten packed cell volumes of 25 mM Tris-HCl (pH 7.5), 
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1 bM EDTA. 10 DM DTT, 1 mM PMSF, and lysed with 2 
passages through a French pr ss. We pell t d th 
membrane fraction by centr if ligation at 10,000 rpm for 
30 minutes in a Sorval SS-34 rotor, and then 
5 resuspended the membrane fraction in 6 M urea. We 
added solid guanidine-HCl to a final concentration of 
6 M, and DTT to a concentration of 10 mM. After 30 
minutes at 37°C, ve clarified the solution by 
centrifugation at 10,000 rpm for 30 min. in a Sorval 
10 SS-34 rotor. We then loaded the sample onto an A. 5 
agarose gel filtration column in 6 H guanidine-HCl, 50 
mM sodium phosphate (pH 5.4), 10 mM DTT and collected 
tat-E2R-containing fractions from the gel filtration 
column, according to the appearance of a band of the 
15 appropriate molecular size on Coomassie-stained SDS 

polyacrylamide electrophoresis gels. We loaded th gel 
filtration-purif ied sample onto a C 18 reverse phase HPLC 
column and eluted with a gradient of 0-75% acetonitrile 
in 0.1% trif luoroacetic acid. We collected the tat- 
20 E2R fusion protein in a single peak with an apparent 
molecular weight of 40,000 Da. 

We assayed cellular uptake of the tat-E2R 
fusion protein by indirect immunofluorescence in mouse 
fibroblast 3T3 cells. In the indirect 
25 immunofluorescence procedure our primary antibody was 
either a rabbit polyclonal antibody against BPVl E2, 
generated by injection of the purified C-terminal 85 
, amino acids of E2, or a rabbit polyclonal antibody 
against tat protein, generated by injection of the 
30 purified 72 amino acid tat protein. We purified each 
of these 2 types of antibodies on an affinity column 
bearing the protein antigen against which the antibody 
was raised. Our secondary antibody was a rhodamine- 
conjugated goat anti-rabbit IgG (Cappel cat. no. 2212- 
35 0081). 
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W seeded the 3T3 c lis into 4 -chamber tissue 
culture chamber/slid (commercially availabl from 
LabTek) . Th £ llowing day w added tat-E2R fusion 
protein or unfused tat protein to the culture m dium at 
5 1 mg/ml, with 0.1 mM chloroguine to inhibit lysosomal 
protease activity. Six hours later we observed 
immunofluorescence according to the following 
procedure. 

We removed the culture medium and washed the 
10 cells twice with phosphate buffered saline ("PBS") . We 
fixed the cells by treatment with 3.5% formaldehyde at 
room temperature, and permeabilized the cells by 
treatment with a solution of 0.2% Triton X-100, 2% 
bovine serum albumin ("BSA") in PBS containing 1 mM 
15 MgCl, and 0.1 mM caCl, which solution is designated 

St 

PBS+. We blocked the cells by treatment with whole 
goat serum (Cappel cat. no. 5006-1380) diluted 1:30 
with PBS+ containing 2% BSA, for 1 hour at 4°C. We 
added primary antibody at a 1:100 dilution in PBS+ 

20 containing 2% BSA for 1 hour at 4°C, and then we added 
secondary antibody at a 1:100 dilution in 0.2% Tween- 
20, 2% BSA, in PBS+ for 30 minutes at 4°C. We washed 
the slides with 0.2% Tween-20, 2% BSA in PBS+, and then 
mounted in 50% glycerol in PBS. For viewing slide 

25 preparations, we used a fluorescent microscope with a 
rhodamine filter. 

With the tat antibody in the above procedure, 
we observed intense internal fluorescence in cells 
exposed to tat-E2R protein, and in positive control 

30 cells exposed to non-fused tat protein. The E2 
antibody gave intracellular fluorescence in cells 
exposed to the tat-E2R fusion protein, but not in cells 
exposed to unfused tat protein. No internal 
fluorescence appear d in negative controls that were 

35 not exposed to the tat-E2R fusion protein or the 
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unfused tat protein. In additional experimental 
c ntrols, neither antibody r suited in significant 
intracellular fluorescence in cells to which an E2 
repressor alone was added. The intensity and 
5 subcellular location of fluorescence was similar 

whether the tat protein or the tat-E2R fusion prot in 
was added to the cells. This indicates that the tat- 
E2R fusion protein entered the cells as efficiently as 
tat protein. These results indicate that the 

10 antibodies were specific for their respective proteins, 
and that the tat protein can deliver the E2R prot in 
into animal cells. An indication of the efficiency of 
the tat- induced uptake is that the intracellular 
fluorescence was far more intense when tat-E2R was 

15 added to cells than when the tat~E2R or E2R gene 
constructs were expressed in transfected cells. We 
have obtained similar results in tests with other forms 
of tat and shorter forms of E2 repressors. 

Microorganisms and recombinant DMA molecules 

20 prepared by the processes of this invention are 
exemplified by cultures deposited in the In Vitro 
International, Inc. culture collection ("IVI"), in 
Linthicum, Maryland on January 18, 1991 and identified 



25 314: fij. coli DH5/pXB314 

337L: £*. coli DH5/pEC337L 

339M: £i coli DH5/pEC339M 

340F: IL. coli DH5/pEC340F 

340R: Ej. coli DH5/pEC340R 

30 340Y: E*. soli DH5/pEC340Y 

344L: Ei. coli DH5/pEC344L. 

« 

These cultures were assigned accession numbers IVI 

10262, IVI 10263, IVI 10264 , IVI 10265, IVI 10266, 
IVI 10267 and IVI 10268, respectively. These cultures 
35 were subsequently transferred from IVI to the American 
Type Culture Collection ( W ATCC M ) in Rockville, 
Maryland, on June 20, 1991. They were assigned ATCC 



* 
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accession numbers 68740, 68735, 68736, 68737, 68738, 
68739 and 68740, respectively. 

Microorganisms and recombinant DNA molecules 
prepared by the processes of this invention are further 
5 exemplified by cultures deposited in the American Type 
Culture Collection, Rockville, Maryland on January 24, 
1992 and identified as: 

pHE2-123 
PHE2-85 

10 BL21(DE3) /pLYSS/pFTE501 

These cultures were assigned ATCC numbers 68896, 68897 
and 68898, respectively. 

While ve have hereinbefore described a number 
of embodiments of this invention, it is apparent that 

15 our basic constructions may be altered to provide other 
embodiments which utilize the processes and products of 
this invention. Therefore, it will be appreciated that 
the scope of this invention is to be defined by th 
claims appended hereto, rather than by the specific 

20 embodiments which have been presented hereinbefore by 
way of example. 

In the following "Sequence Listing", we have 
provided nucleotide sequence and amino acid sequence 
information for the SEQ ID Numbers referred to in this 

25 application. It should be noted that SEQ ID Numbers 2, 
4, 6, 8, 10, 12, 14, 16, 23, 25 and 27 repeat the amino 
acid sequences listed with the nucleotide sequenc s of 
SEQ ID Numbers 1, 3, 5, 7, 9, 11, 13, 15, 22, 24 and 
26, respectively. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: BIOGEN, INC. 

NEW ENGLAND MEDICAL CENTER HOSPITALS , INC. 
BARSOUM, James G. (US only) 
ANDROPHY, Elliot J. (US only) 

(il) TITLE OF INVENTION: PAPILLOMAVIRUS E2 TRANS -ACTIVATION 
REPRESSORS 

(ill) NUMBER OF SEQUENCES: 30 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: FISH & NEAVE 

(B) STREET: 875 Third Avenue 

(C) CITY: New York 

(D) STATE: New York 

(E) COUNTRY: USA 

(F) ZIP: 10022-6250 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Haley Jr., James F. 

(B) REGISTRATION NUMBER: 27,794 

(C) REFERENCE/DOCKET NUMBER: B156CIP 

(tx) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (212)715-0600 

(B) TELEFAX: (212)715-0673 

(C) TELEX: 14-8367 



(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 381 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 
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(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bovine papillomavirus 

(B) STRAIN: Type 1 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .378 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

CCG GTG GAC TTG GCA TCA AGG CAG GAA GAA GAG GAG CAG TCG CCC GAG 48 
Pro Val Asp Leu Ala Ser Arg Gin Glu Glu Glu Glu Gin Ser Pro Asp 
15 10 15 

TCC AC A GAG GAA GAA CCA GTG ACT CTC CCA AGG CGC ACC ACC AAT GAT 96 
Ser Thr Glu Glu Glu Pro Val Thr Leu Pro Arg Arg Thr Thr Asn Asp 

20 25 30 

GGA TTC CAC CTG TTA AAG GCA GGA GGG TCA TGC TTT GCT CTA ATT TCA 144 
Gly Phe His Leu Leu Lys Ala Gly Gly Ser Cys Phe Ala Leu He Ser 
35. 40 45 

GGA ACT GCT AAC CAG GTA AAG TGC TAT CGC TTT CGG GTG AAA AAG AAC 192 
Gly Thr Ala Asn Gin Val Lys Cys Tyr Arg Phe Arg Val Lys Lys Asn 
50 55 60 

CAT AGA CAT CGC TAC GAG AAC TGC ACC ACC ACC TGG TTC ACA GTT GCT 240 
His Arg His Arg Tyr Glu Asn Cys Thr Thr Thr Trp Phe Thr Val Ala 
65 70 75 80 

GAC AAC GGT GCT GAA AGA CAA GGA CAA GCA CAA ATA CTG ATC ACC TTT 288 
Asp Asn Gly Ala Glu Arg Gin Gly Gin Ala Gin He Leu He Thr Phe 

85 90 95 

GGA TCG CCA AGT CAA AGG CAA GAC TTT CTG AAA CAT GTA CCA CTA CCT 336 
Gly Ser Pro Ser Gin Arg Gin Asp Phe Leu Lys His Val Pro Leu Pro 

100 105 110 

CCT GGA ATG AAC ATT TCC GGC TTT ACA GCC AGC TTG GAC TTC 378 
Pro Gly Met Asn He Ser Gly Phe Thr Ala Ser Leu Asp Phe 
115 • 120 125 



TGA 



381 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 126 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Pro Val Asp Leu Ala Ser Arg Gin Glu Glu Glu Glu Gin Ser Pro Asp 
15 10 15 

Ser Thr Glu Glu Glu Pro Val Thr Leu Pro Arg Arg Thr Thr Asn Asp 

20 25 30 

Gly Phe His Leu Leu Lys Ala Gly Gly Ser Cys Phe Ala Leu lie Ser 
35 40 45 

Gly Thr Ala Asn Gin Val Lys Cys Tyr Arg Phe Arg Val Lys Lys Asn 
50 55 60 

His Arg His Arg Tyr Glu Asn Cys Thr Thr Thr Trp Phe Thr Val Ala 
65 70 75 80 

Asp Asn Gly Ala Glu Arg Gin Gly Gin Ala Gin He Leu He Thr Phe 

85 90 95 

Gly Ser Pro Ser Gin Arg Gin Asp Phe Leu Lys His Val Pro Leu Pro 

100 105 110 

Pro Gly Met Asn He Ser Gly Phe Thr Ala Ser Leu Asp Phe 
115 120 125 



(2) INFORMATION FOR SEQ ID N0:3: 

<i) SEQUENCE CHARACTERISTICS,; 

(A) LENGTH: 381 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bovine papillomavirus 

(B) STRAIN: Type 1 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..378 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

CCG GTG GAC TTG GCA TCA AGG CAG GAA GAA GAG GAG CAG TCG CCC GAC 48 
Pro Val Asp Leu Ala Ser Arg Gin Glu Glu Glu Glu Gin Ser Pro Asp 
15 10 15 

TCC ACA GAG GAA GAA CCA GTG ACT CTC CCA AGG CGC ACC ACC AAT GAT 96 
Ser Thr Glu Glu Glu Pro Val Thr Leu Pro Arg Arg Thr Thr Asn Asp 

20 25 30 

GGA TTC CAC CTG TTA AAG GCA GGA GGG TCA TGC TTT GCT CTA ATT TCA 144 
Gly Phe His Leu Leu Lys Ala Gly Gly Ser Cys Phe Ala Leu He Ser 
35 40 45 

GGA ACT GCT AAC CTG GTA AAG TGC TAT CGC TTT CGG GTG AAA AAG AAC 192 
Gly Thr Ala Asn Leu Val Lys Cys Tyr Arg Phe Arg Val Lys Lys Asn 
50 55 60 

CAT AGA CAT CGC TAC GAG AAC TGC ACC ACC ACC TGG TTC ACA GTT GCT 240 
Hts Arg His Arg Tyr Glu Asn Cys Thr Thr Thr Trp Phe Thr Val Ala 
65 70 75 80 

GAC AAC GGT GCT GAA AGA CAA GGA CAA GCA CAA ATA CTG ATC ACC TTT 288 
Asp Asn Gly Ala Glu Arg Gin Gly Gin Ala Gin He Leu He Thr Phe 

85 90 95 

GGA TCG CCA AGT CAA AGG CAA GAC TTT CTG AAA CAT GTA CCA CTA CCT 336 
Gly Ser Pro Ser Gin Arg Gin Asp Phe Leu Lys His Val Pro Leu Pro 

100 105 110 

CCT GGA ATG AAC ATT TCC GGC TTT ACA GCC AGC TTG GAC TTC 378 
Pro Gly Met Asn He Ser Gly Phe Thr Ala Ser Leu Asp Phe 
115 120 125 

TGA 381 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 126 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 

Pro Val Asp Leu Ala Ser Arg Gin Glu Glu Glu Glu Gin Ser Pro Asp 
1 ^ 5 10 15 



Ser Thr Glu Glu Glu Pro Val Thr Leu Pro Arg Arg Thr Thr Asn Asp 

20 25 30 
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Gly Ph His Leu Leu Lys Ala Gly Gly Ser Cys Phe Ala Leu lie Ser 
35 40 45 

Gly Thr Ala Asn Leu Val Lys Cys Tyr Arg Phe Arg Val Lys Lys Asn 
50 55 60 

His Arg His Arg Tyr Glu Asn Cys Thr Thr Thr Trp Phe Thr Val Ala 
65 70 75 80 

Asp Asn Gly Ala Glu Arg Gin Gly Gin Ala Gin He Leu He Thr Phe 

85 90 95 

Gly Ser Pro Ser Gin Arg Gin Asp Phe Leu Lys His Val Pro Leu Pro 

100 105 110 



Pro Gly Met Asn He Ser Gly Phe Thr Ala Ser Leu Asp Phe 
115 120 125 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 381 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..378 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

CCG GTG GAC TTG GCA TCA AGG CAG GAA GAA GAG GAG CAG TCG CCC GAC 48 
Pro Val Asp Leu Ala Ser Arg Gin Glu Glu Glu Glu Gin Ser Pro Asp 
1 5 10 15 

TCC ACA GAG GAA GAA CCA GTG ACT CTC CCA AGG CGC ACC ACC AAT GAT 96 
Ser Thr Glu Glu Glu Pro Val Thr Leu Pro Arg Arg Thr Thr Asn Asp 

20 25 30 

GGA TTC CAC CTG TTA AAG GCA GGA GGG TCA TGC TTT GCT CTA ATT TCA 144 
Gly Phe His Leu Leu Lys Ala Gly Gly Ser Cys Phe Ala Leu He Ser 
35 40 45 

GGA ACT GCT AAC CAG GTA ATG TGC TAT CGC TTT CGG GTG AAA AAG AAC 192 
Gly Thr Ala Asn Gin Val Met Cys Tyr Arg Phe Arg Val Lys Lys Asn 
50 55 60 
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CAT AGA CAT CGC TAC GAG AAC TGC ACC ACC ACC TGG TTC ACA GTT GCT 
His Arg His Arg Tyr Glu Asn Cys Thr Thr Thr Trp Phe Thr Val Ala 
65 70 75 80 



240 



GAC AAC GGT GCT GAA AGA CAA GGA CAA GCA CAA ATA CTG ATC ACC TTT 
Asp Asn Gly Ala Glu Arg Gin Gly Gin Ala Gin lie Leu He Thr Ph 

85 90 95 



288 



GGA TCG CCA AGT CAA AGG CAA GAC TTT CTG AAA CAT GTA CCA CTA CCT 
Gly Ser Pro Ser Gin Arg Gin Asp Phe Leu Lys His Val Pro Leu Pro 

100 105 110 



336 



CCT GGA ATG AAC ATT TCC GGC TTT ACA GCC AGC TTG GAC TTC 
Pro Gly Met Asn He Ser Gly Phe Thr Ala Ser Leu Asp Phe 
115 120 125 



378 



TGA 



381 



(2) INFORMATION FOR SEQ ID NO: 6: 

4 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 126 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Pro Val Asp Leu Ala Ser Arg Gin Glu Glu Glu Glu Gin Ser Pro Asp 
1 5 10 15 



Ser Thr Glu Glu Glu Pro Val Thr Leu Pro Arg Arg Thr Thr Asn Asp 

20 25 30 

Gly Phe His Leu Leu Lys Ala Gly Gly Ser Cys Phe Ala Leu lie Ser 
35 AO 45 



Gly Thr Ala Asn Gin Val Met Cys Tyr Arg Phe Arg Val Lys Lys Asn 
50 55 60 

His Arg His Arg Tyr Glu Asn Cys Thr Thr Thr Trp Phe Thr Val Ala 
65 70 75 80 

Asp Asn Gly Ala Glu Arg Gin Gly Gin Ala Gin He Leu He Thr Phe 

85 90 95 

Gly Ser Pro Ser Gin Arg Gin Asp Phe Leu Lys His Val Pro Leu Pro 

100 105 110 



Pro Gly Met Asn He Ser Gly Phe Thr Ala Ser Leu Asp Phe 
115 120 125 
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(2) INFORMATION FOR SEQ ID NO: 7: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 381 base pairs 

(B) TYPE: nucl ic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bovine papillomavirus 

(B) STRAIN: Type 1 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .378 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

CCG GTG GAC TTG GCA TCA AGG CAG GAA GAA GAG GAG CAG TCG CCC GAC 48 
Pro Val Asp Leu Ala Ser Arg Gin Glu Glu Glu Glu Gin Ser Pro Asp 
1 5 10 15 

TCC ACA GAG GAA GAA CCA GTG ACT CTC CCA AGG CGC ACC ACC AAT GAT 96 
Ser Thr Glu Glu Glu Pro Val Thr Leu Pro Arg Arg Thr Thr Asn Asp 

20 25 30 

GGA TTC CAC CTG TTA AAG GCA GGA GGG TCA TGC TTT GCT CTA ATT TCA 144 
Gly Phe His Leu Leu Lys Ala Gly Gly Ser Cys Phe Ala Leu lie Ser 
35 40 45 

GGA ACT GCT AAC CAG GTA AAG TTC TAT CGC TTT CGG GTG AAA AAG AAC 192 
Gly Thr Ala Asn Gin Val Lys Phe Tyr Arg Phe Arg Val Lys Lys Asn 
50 55 60 

CAT AGA CAT CGC TAC GAG AAC TGC ACC ACC ACC TGG TTC ACA GTT GCT 240 

His Arg His Arg Tyr Glu Asn Cys Thr Thr Thr Trp Phe Thr Val Ala 
65 70 75 80 
* 

GAC AAC GGT GCT GAA AGA CAA GGA CAA GCA CAA ATA CTG ATC ACC TTT 288 

Asp Asn Gly Ala Glu Arg Gin Gly Gin Ala Gin He Leu He Thr Phe 

85 . 90 95 



GGA TCG CCA AGT CAA AGG CAA GAC TTT CTG AAA CAT GTA CCA CTA CCT 
Gly Ser Pro Ser Gin Arg Gin Asp Phe Leu Lys His Val Pro Leu Pro 

100 105 110 



336 
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CCT GGA ATG AAC ATT TCC GGC TTT ACA GCC AGC TTG GAC TTC 378 
Pro Gly Met Asn lie S r Gly Phe Thr Ala Ser Leu Asp Phe 
115 120 125 



TGA 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 126 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Pro Val Asp Leu Ala Ser Arg Gin Glu Glu Glu Glu Gin Ser Pro Asp 
15 10 15 

Ser Thr Glu Glu Glu Pro Val Thr Leu Pro Arg Arg Thr Thr Asn Asp 

20 25 30 

Gly Phe His Leu Leu Lys Ala Gly Gly Ser Cys Phe Ala Leu lie Ser 
35 40 45 

Gly Thr Ala Asn Gin Val Lys Phe Tyr Arg Phe Arg Val Lys Lys Asn 
50 55 60 

His Arg His Arg Tyr Glu Asn Cys Thr Thr Thr Trp Phe Thr Val Ala 
65 70 75 80 

Asp Asn Gly Ala Glu Arg Gin Gly Gin Ala Gin lie Leu lie Thr Phe 

85 90 95 

Gly Ser Pro Ser Gin Arg Gin Asp Phe Leu Lys His Val Pro Leu Pro 

100 105 110 

Pro Gly Met Asn lie Ser Gly Phe Thr Ala Ser Leu Asp Phe 
115 120 125 



(2) INFORMATION FOR SEQ ID NO: 9: 

• 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 381 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



381 
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(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bovine papillomavirus 

(B) STRAIN: Type 1 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..378 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

CCG GTG GAC TTG GCA TCA AGG CAG GAA GAA GAG GAG CAG TCG CCC GAC 48 
Pro Val Asp Leu Ala Ser Arg Gin Glu Glu Glu Glu Gin Ser Pro Asp 
15 10 15 

TCC ACA GAG GAA GAA CCA GTG ACT CTC CCA AGG CGC ACC ACC AAT GAT 96 
Ser Thr Glu Glu Glu Pro Val Thr Leu Pro Arg Arg Thr Thr Asn Asp 

20 25 30 

GGA TTC CAC CTG TTA AAG GCA GGA GGG TCA TGC TTT GCT CTA ATT TCA 144 
Gly Phe His Leu Leu Lys Ala Gly Gly Ser Cys Phe Ala Leu lie Ser 
35 40 45 

GGA ACT GCT AAC CAG GTA AAG CGC TAT CGC TTT CGG GTG AAA AAG AAC 192 
Gly Thr Ala Asn Gin Val Lys Arg Tyr Arg Phe Arg Val Lys Lys Asn 
50 55 60 

CAT AGA CAT CGC TAC GAG AAC TGC ACC ACC ACC TGG TTC ACA GTT GCT 240 
His Arg His Arg Tyr Glu Asn Cys Thr Thr Thr Trp Phe Thr Val Ala 
65 70 75 80 

GAC AAC GGT GCT GAA AGA CAA GGA CAA GCA CAA ATA CTG ATC ACC TTT 288 
Asp Asn Gly Ala Glu Arg Gin Gly Gin Ala Gin lie Leu He Thr Phe 

85 90 95 

GGA TCG CCA AGT CAA AGG CAA GAC TTT CTG AAA CAT GTA CCA CTA CCT 336 
Gly Ser Pro Ser Gin Arg Gin Asp Phe Leu Lys His Val Pro Leu Pro 

100 105 110 

T 

CCT GGA ATG AAC ATT TCC GGC TTT ACA GCC AGC TTG GAC TTC 378 
Pro Gly Met Asn lie Ser Gly Phe Thr Ala Ser Leu Asp P*e 
115 120 v 125 

TGA 381 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 126 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Pro Val Asp Leu Ala Ser Arg Gin Glu Glu Glu Glu Gin Ser Pro Asp 
1 5 10 15 

Ser Thr Glu Glu Glu Pro Val Thr Leu Pro Arg Arg Thr Thr Asn Asp 

20 25 30 

Gly Phe His Leu Leu Lys Ala Gly Gly Ser Cys Phe Ala Leu lie Ser 
35 40 45 

Gly Thr Ala Asn Gin Val Lys Arg Tyr Arg Phe Arg Val Lys Lys Asn 
50 55 60 

* 

His Arg His Arg Tyr Glu Asn Cys Thr Thr Thr Trp Phe Thr Val Ala 
65 70 75 80 

Asp Asn Gly Ala Glu Arg Gin Gly Gin Ala Gin lie Leu lie Thr Phe 

85 90 95 

Gly Ser Pro Ser Gin Arg Gin Asp Phe Leu Lys His Val Pro Leu Pro 

100 105 110 

Pro Gly Met Asn lie Ser Gly Phe Thr Ala Ser Leu Asp Phe 
115 120 125 



(2) INFORMATION FOR SEQ ID NO; 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 381 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bovine papillomavirus 

(B) STRAIN: Type 1 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..378 
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* 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

CCG GTG GAC TTG GCA TCA AGG CAG GAA GAA GAG GAG CAG TCG CCC GAC 48 
Pro Val Asp Leu Ala Ser Arg Gin Glu Glu Glu Glu Gin Ser Pro Asp 
15 10 15 

TCC ACA GAG GAA GAA CCA GTG ACT CTC CCA AGG CGC ACC ACC AAT GAT 96 
Ser Thr Glu Glu Glu Pro Val Thr Leu Pro Arg Arg Thr Thr Asn Asp 

20 25 30 

GGA TTC CAC CTG TTA AAG GCA GGA GGG TCA TGC TTT GCT CTA ATT TCA 144 
Gly Phe His Leu Leu Lys Ala Gly Gly Ser Cys Phe Ala Leu lie Ser 
35 40 45 

GGA ACT GCT AAC CAG GTA AAG TAC TAT CGC TTT CGG GTG AAA AAG AAC 192 
Gly Thr Ala Asn Gin Val Lys Tyr Tyr Arg Phe Arg Val Lys Lys Asn 
50 55 60 

CAT AGA CAT CGC TAC GAG AAC TGC ACC ACC ACC TGG TTC ACA GTT GCT 240 
His Arg His Arg Tyr Glu Asn Cys Thr Thr Thr Trp Phe Thr Val Ala 
65 70 75 80 

GAC AAC GGT GCT GAA AGA CAA GGA CAA GCA CAA ATA CTG ATC ACC TTT 288 
Asp Asn Gly Ala Glu Arg Gin Gly Gin Ala Gin lie Leu lie Thr Phe 

85 90 95 

GGA TCG CCA AGT CAA AGG CAA GAC TTT CTG AAA CAT GTA CCA CTA CCT 336 
Gly Ser Pro Ser Gin Arg Gin Asp Phe Leu Lys His Val Pro Leu Pro 

100 105 110 

CCT GGA ATG AAC ATT TCC GGC TTT ACA GCC AGC TTG GAC TTC 378 
Pro Gly Met Asn lie Ser Gly Phe Thr Ala Ser Leu Asp Phe 
115 120 125 

TGA ~ 381 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 126 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Pro Val Asp Leu Ala Ser Arg Gin Glu Glu Glu Glu Gin Ser Pro Asp 
1 5 10 15 

Ser Thr Glu Glu Glu Pro Val Thr Leu Pro Arg Arg Thr Thr Asn Asp 

20 25 30 
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Gly Phe His Leu Leu Lys Ala Gly .Gly Ser Cys Phe Ala Leu He Ser 

35 40 45 

Gly Thr Ala Asn Gin Val Lys Tyr Tyr Arg Phe Arg Val Lys Lys Asn 

50 55 60 

His Arg His Arg Tyr Glu Asn Cys Thr Thr Thr Trp Phe Thr Val Ala 

65 70 75 80 

Asp Asn Gly Ala Glu Arg Gin Gly Gin Ala Gin He Leu He Thr Phe 

85 90 • 95 

Gly Ser Pro Ser Gin Arg Gin Asp Phe Leu Lys His Val Pro Leu Pro 

100 105 110 

Pro Gly Met Asn He Ser Gly Phe Thr Ala Ser Leu Asp Phe 

115 * 120 125 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 381 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ID MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bovine papillomavirus 

(B) STRAIN: Type 1 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..378 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

CCG GTG GAC TTG GCA TCA AGG CAG GAA GAA GAG GAG CAG TCG CCC GAC 48 

Pro Val Asp Leu Ala Ser Arg Gin Glu Glu Glu Glu Gin Ser Pro Asp 

1 5 10 15 

TCC ACA GAG GAA GAA CCA GTG ACT CTC CCA AGG CGC ACC ACC AAT GAT 96 

Ser Thr Glu Glu Glu Pro Val Thr Leu Pro Arg Arg Thr Thr Asn Asp 

20 25 30 

GGA TTC CAC CTG TTA AAG GCA GGA GGG TCA TGC TTT GCT CTA ATT TCA 144 

Gly Phe His Leu Leu Lys Ala Gly Gly Ser Cys Phe Ala Leu He Ser 

35 40 45 
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GGA ACT GCT AAC CAG GTA AAG TGC TAT CGC TTT CTG GTG AAA AAG AAC 
Gly Thr Ala Asn Gin Val Lys Cys Tyr Arg Phe Leu Val Lys Lys Asn 
50 55 60 



192 



CAT AGA CAT CGC TAG GAG AAC TGC ACC ACC ACC TGG TTC ACA GTT GCT 
His Arg His Arg Tyr Glu Asn Cys Thr Thr Thr Trp Phe Thr Val Ala 
65 70 75 80 



240 



GAC AAC GGT GCT GAA AGA CAA GGA CAA GCA CAA ATA CTG ATC ACC TTT 
Asp Asn Gly Ala Glu Arg Gin Gly Gin Ala Gin He Leu He Thr Phe 

85 ~ 90 95 



288 



GGA TCG CCA AGT CAA AGG CAA GAC TTT CTG AAA CAT GTA CCA CTA CCT 
Gly Ser Pro Ser Gin Arg Gin Asp Phe Leu Lys His Val Pro Leu Pro 

100 105 110 



336 



CCT GGA ATG AAC ATT TCC GGC TTT ACA GCC AGC TTG GAC TTC 
Pro Gly Met Asn He Ser Gly Phe Thr Ala Ser Leu Asp Phe 
115 " 120 125 



378 



TGA 



381 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 126 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Pro Val Asp Leu Ala Ser Arg Gin Glu Glu Glu Glu Gin Ser Pro Asp 
1 5 10 15 

Ser Thr Glu Glu Glu Pro Val Thr Leu Pro Arg Arg Thr Thr Asn Asp 

20 25 30 

Gly Phe His Leu Leu Lys Ala Gly Gly Ser Cys Phe Ala Leu He Ser 
35 40 45 

Gly Thr Ala Asn Gin Val Lys Cys Tyr Arg Phe Leu Val Lys Lys Asn 
50 55 60 



His Arg His Arg Tyr Glu Asn Cys Thr Thr Thr Trp Phe Thr Val Ala 

65 70 75 80 

Asp Asn Gly Ala Glu Arg Gin Gly Gin Ala Gin lie Leu lie Thr Phe 

85 " 90 95 



Gly Ser Pro Ser Gin Arg Gin Asp Phe Leu Lys His Val Pro Leu Pro 

100 105 110 
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Pro Gly Met Asn He Ser Gly Phe Thr Ala Ser Leu Asp Phe 
115 120 125 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 222 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bovine papillomavirus 

(B) STRAIN: Type 1 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..219 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

GTA AAG TGC TAT CGC TTT CGG GTG AAA AAG AAC CAT AGA CAT CGC TAC 48 
Val Lys Cys Tyr Arg Phe Arg Val Lys Lys Asn His Arg His Arg Tyr 
15 10 15 

GAG AAC TGC ACC ACC ACC TGG TTC ACA GTT GCT GAC AAC GGT GCT GAA 96 
Glu Asn Cys Thr Thr Thr Trp Phe Thr Val Ala Asp Asn Gly Ala Glu 

20 25 30 

AGA CAA GGA CAA GCA CAA ATA CTG ATC ACC TTT GGA TCG CCA AGT CAA 144 
Arg Gin Gly Gin Ala Gin He Leu He Thr Phe Gly Ser Pro Ser Gin 
35 40 45 

AGG CAA GAC TTT CTG AAA CAT GTA CCA CTA CCT CCT GGA ATG AAC ATT 192 
Arg Gin Asp Phe Leu Lys His Val Pro Leu Pro Pro Gly Met Asn He 
50 '55 60 

TCC GGC TTT ACA GCC AGC TTG GAC TTC TGA 222 
Ser Gly Phe Thr Ala Ser Leu Asp Phe 
65 70 
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(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Val Lys Cys Tyr Arg Phe Arg Val Lys Lys Asn His Arg His Arg Tyr 
15 10 15 

Glu Asn Cys Thr Thr Thr Trp Phe Thr Val Ala Asp Asn Gly Ala Glu 

20 25 30 

Arg Gin Gly Gin Ala Gin He Leu He Thr Phe Gly Ser Pro Ser Gin 
35 .40 45 

Arg Gin Asp Phe Leu Lys His Val Pro Leu Pro Pro Gly Met Asn He 
50 55 60 

Ser Gly Phe Thr Ala Ser Leu Asp Phe 
65 70 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
AGCAACTAGT CCCAAG 16 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
CTCGAGAAGC TTGACGGATC CG 22 
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(2) INFORMATION FOR SEQ ID NO: 19: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 



TGCAGAGCTC TTCGAACTGC CTAGGCTTAA 



30 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
GATCCTTTGC CGCCAC 16 



(2) INFORMATION FOR SEQ ID NO:21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
GAAACGGCGG TGGTAC 16 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 258 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(ix) 



FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..255 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

ATG GCT AGC AAC ACT ACA CCC ATA GTA CAT TTA AAA GGT GAT GCT AAT 48 
Met Ala Ser Asn Thr Thr Pro He Val His Leu Lys Gly Asp Ala Asn 
1 5 10 15 

ACT TTA AAA TGT TTA AGA TAT AGA TTT AAA AAG CAT TGT ACA TTG TAT 96 
Thr Leu Lys Cys Leu Arg Tyr Arg Phe Lys Lys His Cys Thr Leu Tyr 

20 25 30 

ACT GCA GTG TCG TCT ACA TGG CAT TGG ACA GGA CAT AAT GTA AAA CAT 144 
Thr Ala Val Ser Ser Thr Trp His Trp Thr Gly His Asn Val Lys His 
35 40 45 

AAA AGT GCA ATT GTT ACA CTT ACA TAT GAT AGT GAA TGG CAA CGT GAC 192 
Lys Ser Ala He Val Thr Leu Thr Tyr Asp Ser Glu Trp Gin Arg Asp 
50 55 60 

CAA TTT TTG TCT CAA GTT AAA ATA CCA AAA ACT ATT ACA GTG TCT ACT 240 
Gin Phe Leu Ser Gin Val Lys He Pro Lys Thr He Thr Val Ser Thr 
65 70 75 80 

GGA TTT ATG TCT ATA TGA 258 
Gly Phe Met Ser He 

85 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Met Ala Ser Asn Thr Thr Pro He Val His Leu Lys Gly Asp Ala Asn 
15 10 15 

Thr Leu Lys Cys Leu Arg Tyr Arg Phe Lys Lys His Cys Thr Leu Tyr 

20 25 3^ 

a 

Thr Ala Val Ser Ser Thr Trp His Trp Thr Gly His Asn Val Lys His 
35 40 45 

Lys Ser Ala He Val Thr Leu Thr Tyr Asp Ser Glu Trp Gin Arg Asp 
50 55 60 

Gin Phe Leu Ser Gin Val Lys He Pro Lys Thr He Thr Val Ser Thr 
65 70 75 80 



Gly Phe Met Ser He 

85 
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(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 372 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .369 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

ATG GTA CCA GAC ACC GGA AAC CCC TGC CAC ACC ACT AAG TTG TTG CAC 48 
Met Val Pro Asp Thr Gly Asn Pro Cys His Thr Thr Lys Leu Leu His 
1 5 10 15 

AGA GAC TCA GTG GAC AGT GCT CCA ATC CTC ACT GCA TTT AAC AGC TCA 96 
Arg Asp Ser Val Asp Ser Ala Pro lie Leu Thr Ala Phe Asn Ser Ser 

20 25 30 

CAC AAA GGA CGG ATT AAC TGT AAT AGT AAC ACT ACA CCC ATA GTA CAT 144 
His Lys Gly Arg lie Asn Cys Asn Ser Asn Thr Thr Pro lie Val His 
35 40 45 

TTA AAA GGT GAT GCT AAT ACT TTA AAA TG$ TTA AGA TAT AGA TTT AAA 192 
Leu Lys Gly Asp Ala Asn Thr Leu Lys Cys Leu Arg Tyr Arg Phe Lys 
50 55 60 

AAG CAT TGT ACA TTG TAT ACT GCA GTG TCG TCT ACA TGG CAT TGG ACA 240 
Lys His Cys Thr Leu Tyr Thr Ala Val Ser Ser Thr Trp His Trp Thr 
65 70 75 80 

GGA CAT AAT GTA AAA CAT AAA AGT GCA ATT GTT ACA CTT ACA TAT GAT 288 
Gly His Asn Val Lys His Lys Ser Ala He Val Thr Leu Thr Tyr Asp 

85 90 95 

AGT GAA TGG CAA CGT GAC CAA TTT TTG TCT CAA GTT AAA ATA CCA AAA 336 
Ser Glu Trp Gin Arg Asp Gin Phe Leu Ser Gin Val Lys He Pro Lys 

100 105 110 

ACT ATT ACA GTG TCT ACT GGA TTT ATG TCT ATA TGA 372 
Thr He Thr Val Ser Thr Gly Phe Met Ser He 
115 120 



(2) INFORMATION FOR SEQ ID N0:25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 123 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Met Val Pro Asp Thr Gly Asn Pro Cys His Thr Thr Lys Leu Leu His 
15 10 15 

Arg Asp Ser Val Asp Ser Ala Pro He Leu Thr Ala Phe Asn Ser Ser 

20 25 30 

His Lys Gly Arg He Asn Cys Asn Ser Asn Thr Thr Pro He Val His 
35 40 45 

Leu Lys Gly Asp Ala Asn Thr Leu Lys Cys Leu Arg Tyr Arg Phe Lys 
50 55 60 

Lys His Cys Thr Leu Tyr Thr Ala Val Ser Ser Thr Trp His Trp Thr 
65 70 75 80 

Gly His Asn Val Lys His Lys Ser Ala He Val Thr Leu Thr Tyr Asp 

85 90 95 

Ser Glu Trp Gin Arg Asp Gin Phe Leu Ser Gin Val Lys He Pro Lys 

100 105 110 

Thr He Thr Val Ser Thr Gly Phe Met Ser He 
115 120 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 939 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION : 1..936 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

ATG GAA CCG GTC GAC CCG CGT CTG GAA CCA TGG AAA CAC CCC GGG TCC 48 
Met Glu Pro Val Asp Pro Arg Leu Glu Pro Trp Lys His Pro Gly Ser 
1 5 10 15 

CAG CCG AAA ACC GCG TGC ACC AAC TGC TAC TGC AAA AAA TGC TGC TTC 96 
Gin Pro Lys Thr Ala Cys Thr Asn Cys Tyr Cys Lys Lys Cys Cys Phe 

20 25 30 
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CAC TGC CAG GTT TGC TTC ATC ACC AAA GCC CTA GGT ATC TCT TAC GGC 
Hts Cys Gin Val Cys Phe He Thr Lys Ala Leu Gly He Ser Tyr Gly 
35 40 45 



144 



CGT AAA AAA CGT CGT CAG CGA CGT CGT CCG CCG CAG GGA TCT TCC ATG 
Arg Lys Lys Arg Arg Gin Arg Arg Arg Pro Pro Gin Gly Ser Ser Met 
50 55 60 



192 



GCC GGT GCT GGA CGC ATT TAC TAT TCT CGC TTT GGT GAC GAG GCA GCC 
Ala Gly Ala Gly Arg He Tyr Tyr Ser Arg Phe Gly Asp Glu Ala Ala 
65 70 75 80 

AGA TTT AGT ACA ACA GGG CAT TAC TCT GTA AGA GAT CAG GAC AGA GTG 
Arg Phe Ser Thr Thr Gly His Tyr Ser Val Arg Asp Gin Asp Arg Val 

85 90 95 



240 



288 



TAT GCT GGT GTC TCA TCC ACC TCT TCT GAT TTT AGA GAT CGC CCA GAC 
Tyr Ala Gly Val Ser Ser Thr Ser Ser Asp Phe Arg Asp Arg Pro Asp 

100 105 110 



336 



GGA GTC TGG GTC GCA TCC GAA GGA CCT GAA GGA GAC CCT GCA GGA AAA 
Gly Val Trp Val Ala Ser Glu Gly Pro Glu Gly Asp Pro Ala Gly Lys 
115 120 125 



384 



GAA GCC GAG CCA GCC CAG CCT GTC TCT TCT TTG CTC GGC TCC CCC GCC 
Glu Ala Glu Pro Ala Gin Pro Val Ser Ser Leu Leu Gly Ser Pro Ala 
130 135 140 



432 



TGC GGT CCC ATC AGA GCA GGC CTC GGT TGG GTA CGG GAC GGT CCT CGC 
Cys Gly Pro He Arg Ala Gly Leu Gly Trp Val Arg Asp Gly Pro Arg 
145 150 155 160 



480 



TCG CAC CCC TAC AAT TTT CCT GCA GGC TCG GGG GGC TCT ATT CTC CGC 
Ser His Pro Tyr Asn Phe Pro Ala Gly Ser Gly Gly Ser He Leu Arg 

165 170 ' 175 



528 



TCT TCC TCC ACC CCG GTG CAG GGC ACG GTA CCG GTG GAC TTG GCA TCA 
Ser Ser Ser Thr Pro Val Gin Gly Thr Val Pro Val Asp Leu Ala Ser 

180 185 190 



576 



AGG CAG GAA GAA GAG GAG CAG TCG CCC GAC TCC ACA GAG GAA GAA CCA 
Arg Gin Glu Glu Glu Glu Gin Ser Pro Asp Ser Thr Glu Glu Glu Pro 
195 200 205 



624 



GTG ACT CTC CCA AGG CGC ACC ACC AAT GAT GGA TTC CAC CTG TTA AAG 
Val Thr Leu Pro Arg Arg Thr Thr Asn Asp Gly Phe His Leu Leu Lys 
210 215 220 



672 



GCA GGA GGG TCA TGC TTT GCT CTA ATT TCA GGA ACT GCT AAC CAG GTA 
Ala Gly Gly Ser Cys Phe Ala Leu He Ser Gly Thr Ala Asn Gin Val 
225 230 235 240 



720 



AAG TGC TAT CGC TTT CGG GTG AAA AAG AAC CAT AGA CAT CGC TAC GAG 
Lys Cys Tyr Arg Phe Arg Val Lys Lys Asn His Arg His Arg Tyr Glu 

245 * 250 255 



768 



t 
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AAC TGC ACC ACC ACC TGG TTC ACA GTT GCT GAC AAC GGT GCT GAA AGA 
Asn Cys Thr Thr Thr Trp Phe Thr Val Ala Asp Asn Gly Ala Glu Arg 

260 265 270 



816 



CAA GGA CAA GCA CAA ATA CTG ATC ACC TTT GGA TCG CCA AGT CAA AGG 
Gin Gly Gin Ala Gin lie Leu He Thr Phe Gly Ser Pro Ser Gin Arg 
275 280 285 



864 



CAA GAC TTT CTG AAA CAT GTA CCA CTA CCT CCT GGA ATG AAC ATT TCC 
Gin Asp Phe Leu Lys His Val Pro Leu Pro Pro Gly Met Asn He Ser 
290 295 300 



912 



GGC TTT ACA GCC AGC TTG GAC TTC TGA 
Gly Phe Thr Ala Ser Leu Asp Phe 
305 310 



939 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 312 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

Met Glu Pro Val Asp Pro Arg Leu Glu Pro Trp Lys His Pro Gly Ser 
15 10 15 

Gin Pro Lys Thr Ala Cys Thr Asn Cys Tyr Cys Lys Lys Cys Cys Phe 

20 25 30 

His Cys Gin Val Cys Phe He Thr Lys Ala Leu Gly He Ser Tyr Gly 
35 40 45 

Arg Lys Lys Arg Arg Gin Arg Arg Arg Pro Pro Gin Gly Ser Ser Met 
50 55 .60 

Ala Gly Ala Gly Arg He Tyr Tyr Ser Arg Phe Gly Asp Glu Ala Ala 
65 70 75 80 

Arg Phe Ser Thr Thr Gly His Tyr Ser Val Arg Asp Gin Asp Arg Val 

85 90 95 

Tyr Ala Gly Val Ser Ser Thr Ser Ser Asp Phe Arg Asp Arg Pro Asp 

100 105 110 

Gly Val Trp Val Ala Ser Glu Gly Pro Glu Gly Asp Pro Ala Gly Lys 
115 120 125 



Glu Ala Glu Pro Ala Gin Pro Val Ser Ser Leu Leu Gly Ser Pro Ala 
130 135 140 
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Cys Gly Pro II Arg Ala Gly Leu Gly Trp Val Arg Asp Gly Pro Arg 
145 150 155 160 

Ser His Pro Tyr Asn Ph Pro Ala Gly Ser Gly Gly Ser lie Leu Arg 

165 170 175 

» 

Ser Ser Ser Thr Pro Val Gin Gly Thr Val Pro Val Asp Leu Ala Ser 

180 185 190 

Arg Gin Glu Glu Glu Glu Gin Ser Pro Asp Ser Thr Glu Glu Glu Pro 
195 200 205 

Val Thr Leu Pro Arg Arg Thr Thr Asn Asp Gly Phe His Leu Leu Lys 
210 215 220 

Ala Gly Gly Ser Cys Phe Ala Leu He Ser Gly Thr Ala Asn Gin Val 
225 230 235 240 

Lys Cys Tyr Arg Phe Arg Val Lys Lys Asn His Arg His Arg Tyr Glu 

245 250 255 

Asn Cys Thr Thr Thr Trp Phe Thr Val Ala Asp Asn Gly Ala Glu Arg 

260 265 270 

Gin Gly Gin Ala Gin He Leu He Thr Phe Gly Ser Pro Ser Gin Arg 
275 280 285 

Gin Asp Phe Leu Lys His Val Pro Leu Pro Pro Gly Met Asn He Ser 
290 295 300 

Gly Phe Thr Ala Ser Leu Asp Phe 
305 310 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

■ 

CTCCCATGGC TAGCAACACT ACACCC 26 



(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
CTCCCATGGT ACCAGACACC GGAAACC 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
GGGGGATCCT CATATAGACA TAAATCC 



9 
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CLAIMS 

1. An E2 trans -activation repress r 
comprising a polypeptide having an amino acid s guence 
homologous to the native E2 DNA binding domain (SEQ ID 
MO: 1) , said polypeptide being capable of forming 
inactive heterodimers with native E2 protein and said 
inactive heterodimers being incapable of binding to E2 
DNA binding sites. 

2 . An E2 trans -activation repressor 
comprising a polypeptide fragment of the native E2 DNA 
binding domain, said fragment being capable of forming 
inactive heterodimers with native E2 protein and said 
inactive heterodimers being incapable of binding to E2 
DNA binding sites. 

3. The E2 trans -activation repressor 
according to claim l f said repressor consisting 
essentially of a polypeptide having an amino acid 
sequence homologous to the native E2 DNA binding domain 
(SEQ ID N0:l), said polypeptide being capable of 
forming inactive heterodimers with native E2 protein 
and said inactive heterodimers being incapable of 
binding to E2 DNA binding sites. 

4. The E2 trans -activation repressor 
according to claim 2, said repressor consisting 
essentially of a polypeptide fragment *of the native E2 
DNA binding domain, said fragment being capable of 
forming inactive heterodimers with native E2 protein 
and said inactive heterodimers being incapable of 
binding to E2 DNA binding sites. 

5. The E2 trans -activation repressor 
according to claim 1, wherein said amino acid s guence 
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is selected fr id the gr up consisting of the amino acid 
sequence d fined by SEQ ID NO: 3, the amin acid 
sequence defined by SEQ ID HO: 5, the amino acid 
sequence defined by SEQ ID NO: 7, the amino acid 
sequence defined by SEQ ID NO: 9, the amino acid 
sequence defined by SEQ ID NO: 11, the amino acid 
sequence defined by SEQ ID NO: 13, the amino acid 
sequence defined by SEQ ID NO: 23 and the amino acid 
sequence defined by SEQ ID NO: 25. 

6. The E2 ££aa£-activation repressor 
according to claim 2 , wherein said repressor consists 
essentially of a polypeptide of the formula selected 
from the group consisting of: 

a) AA 338 to AA 41(J of the native BPV1 E2 
protein (SEQ ID NO: 15); and 

b) ^283 to ^365 of tiie native 
HPV16 E2 protein (SEQ ID NO:23). 

7. An E2 tcass-activation repressor 
comprising a non-peptide moiety, said moiety being 
capable of specifically forming inactive complexes with 
full-length native E2 polypeptides. 

8. A DNA sequence encoding an E2 trans - 
activation repressor having an amino acid sequence 
homologous to the native E2 DNA binding domain (SEQ ID 
N0:1), said repressor being capable of forming inactive 
heterodimers with native E2 protein and said inactive 
heterodimers being incapable of binding to E2 DNA 
binding sites, wherein said DNA sequence comprises a 
DNA sequence selected from the group consisting of: 

a) SEQ ID NO: 3; 

b) SEQ ID NO: 5; 

c) SEQ ID NO: 7; 
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d) SEQ ID NO: 9; 

e) SEQ ID NO: 11; 

f) SEQ ID NO: 13; 

g) SEQ ID NO:22; 

h) SEQ ID NO: 24; and 

i) DNA sequences which encode the 



amino acid sequence encoded by any one of the foregoing 
DNA sequences. 



activation repressor comprising a polypeptide fragment 
of the native E2 DNA binding domain, said fragment 
being capable of forming inactive heterodimers with 
native E2 protein and said inactive heterodimers being 
incapable of binding to E2 DNA binding sites, wh rein 
said DNA sequence comprises a DNA sequence selected 
from the group consisting of: 

a) DNA sequences which are a fragment 
of the native E2 DNA binding domain and 

b) DNA sequences which encode the 
amino acid sequence encoded by any one of the foregoing 
DNA sequences. 



10. The DNA sequence according to claim 8, 



selected from the group consisting of the DNA inserts 

Of: pEC337L, pEC339M, pEC340F, pEC340R, pEC340Y, 
pEC344L, pHE2-85 and pHE2-123. 



11. A recombinant DNA molecule comprising a 
DNA sequence according to claim 8 or 9. 



A DNA sequence encoding an E2 trans - 



12. The recombinant DNA molecule according 
to claim 11, wherein said DNA sequence is op ratively 
linked to an expression control sequence. 
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13. The recombinant DMA mol cul according 
to claim 12, wherein the expr ssi n control sequence is 
selected from the group consisting of the early and 
late promoters of SV40, adenovirus or cytomegalovirus 
immediate early promoter, the l?c system, the ££fi 
system, the £&£ or TRC system, T7 promoter whose 
expression is directed by T7 RNA polymerase, the major 
operator and promoter regions of phage A, the control 
regions of fd coat protein, the promoter for 
3-phosphoglycerate kinase or other glycolytic enzymes, 
the promoters of acid phosphatase, the promoters of th 
yeast a-mating factors, the polyhedron promoter of the 
baculo virus system and other sequences known to control 
the expression of genes of prokaryotic or eukaryotic 
cells or their viruses, and combinations thereof. 

14 . The recombinant DNA molecule according 
to claim 13, said molecule being selected from the 
group consisting of pEC337L, pEC339M, pEC340F, pEC340R, 
PEC340Y, pEC344L, pHE2-85 and pHE2-123. 

15. A unicellular host transformed with a 
recombinant DNA molecule according to any one of 
claims 11 to 14. 

16. The transformed host according to 

• ■ t. 

claim 15, wherein said unicellular host is selected 
'from the group consisting of E.coli . Pseudomonas , 
Bacillus , Streptomyces Saccharomyces and other fungi, 
plant cells in culture, insect cells in culture, animal 
cells iii culture and human cells in culture „ 

17. The transformed host according to claim . 
16, wherein said host is selected from the group 
consisting of E.coli DH5/pEC337L, E-coli DH5/pEC339M, 
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E.coli DH5/pEC340F, E.coli DH5/pEC340R, E.coli 
DH5/pEC340Y, E.C li DH5/pEC344L, E.coli pHE2-85, E.coli 
pHE2-123 and E.coli pFTESOl. 

18. A process for producing an E2 trans - 
activation repressor, said process comprising the st ps 
of: 

a) culturing a transformed host 
according to claim 15; and 

b) recovering the E2 trans -activation 
repressor from said culture. 

19. The E2 tcans-activation repressor 
according to claim 1 or 2, said repressor further 
comprising a lipophilic moiety. 

20. The E2 trans -activation repressor 
according to claim 19, wherein said lipophilic moiety 
is a fatty acid molecule. 

21. The E2 trans -activation repressor 
according to claim 1 or 2, said repressor further 
comprising a basic polymer. 

22. The E2 trans -activation repressor 
according to claim 21, wherein said polymer is selected 
from the group consisting of polyarginine and 

poly lysine. 

23. An E2 trans -activation repressor fusion 
protein comprising an E2 trans -activation repressor 
according to any one of claims 1 to 3 and a transport 
moiety. 
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24. The E2 t rans -act i va t ion repressor fusion 
protein according to claim 23, wherein said transport 
moiety is selected from the group consisting of 
bacterial hemolysins, cell entry components of 
bacterial toxins, viral receptors, cell receptors, cell 
ligands, bacterial immunogens, parasitic immunogens, 
viral immunogens, immunoglobulins or fragments thereof 
that bind to target molecules, cytokines, growth 
factors, colony stimulating factors, tat protein and 
hormones . 

25. The E2 trans -activation repressor fusion 
protein according to claim 23 or 24, wherein the amino 
terminus of the E2 repressor is fused to the carboxy 
terminus of the transport moiety, 

26. A pharmaceutically acceptable 
composition for treating papillomavirus infection 
comprising a pharmaceutically effective amount of an E2 
trans -activation repressor according to any one of 
claims 1 to 3 and a pharmaceutically acceptable 
carrier. 

27. A pharmaceutically acceptable 
composition for treating papillomavirus infection 
comprising an E2 trans -activation repressor fusion 
protein according to claim 23 or 24 and a 
pharmaceutically acceptable carrier. 

28. A pharmaceutically acceptable 
composition for treating papillomavirus infection 
comprising an E2 trans -activation repressor according 
to claim 7 and a pharmaceutically acceptable carrier. 
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29. A method for tr ating papillomavirus 
inf ction in a patient comprising th step of 
administering to the patient a pharmaceutically 
effective amount of a pharmaceutically acceptabl 
composition according to any one of claims 26 to 28. 

30. The method according to claim 29, 
wherein said pharmaceutically effective amount is 
between about 1 and 1000 mg/kg body weight per day when 
said composition is administered parenterally. 

31. The method according to claim 29 , 
wherein said pharmaceutically effective amount is 
between about 1 and 1000 ng/ml when said compositi n is 
administered topically. 

32. A method for treating papillomavirus- 
infected cells comprising the step of introducing into 
said cells an E2 trans -activation repressor according 
to any one of claims 1, 2, 3 or 7. 

33. A method for treating papillomavirus- 
infected cells comprising the step of introducing into 
said cells a DNA sequence according to any one of 
claims 8 to 10. 



34. The method 
wherein said DNA sequence 
by treating said cells to 

35. The method 
wherein said DNA sequence 
by electroporation. 



according to claim 33, 

is introduced into said cells 

allow uptake of said DNA. 

according to claim 33, 

is introduced into said c lis 
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36. The method according to claim 33 , 
wherein said DKA sequenc is introduced into said cells 
by direct injection. 

37. The method according to claim 33, 
wherein said DNA sequence is introduced into said cells 
by means of a defective recombinant virus. 

38. The method according to claim 37, said 
method comprising the steps of: 

a) transcribing said DNA sequence into 
an RNA sequence; and 

b) incorporating said RNA sequence 
into a defective recombinant retrovirus . 

39. A method for isolating mutations 
yielding E2 homologous polypeptides that repress trans - 
activation by native E2 proteins comprising the steps 
of: 

a) producing a population of mutant 
DNA fragments by mutagenizing a DNA fragment encoding 
the amino acids of the native minimal DNA binding 
domain of a papillomavirus E2 protein; 

b) operatively linking said population 
of DNA fragments to expression control sequences; 

c) inserting said population of DNA 
fragments operatively. linked to expression control 
sequences into a vector; 

d) transforming a unicellular host 
with said vector, said host comprising a full-length 
native E2 gene operatively linked to expression control 
sequences and a reporter gene whose transcription is 
activated by dimers comprising full-length native E2 
proteins; 




A 
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) scr ening for clon s of transformed 

h sts that do not display E2 Jyan&-activation; 

f ) testing E2 homologous polyp ptid s 
from said clones for their ability to dimerize with 
themselves; 

g) testing said E2 homologous 
polypeptides for their ability to bind to E2 DMA 
binding sites; 

h) testing said E2 homologous 
polypeptides for their ability to repress 
transcriptional activation by full-length native E2 
protein in a eukaryotic cell* 

40. The use of a polypeptide comprising the 
E2 protein dimerization region to design E2 trans- 
activation repressors. 

41. The use of a polypeptide consisting 
essentially of the E2 protein dimerization region to 
design E2 trans -act i va t ion repressors. 
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